NEWAgents can now see video via MCP.Try it now →

    Image To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 7396 of 300 models

    Image To Text

    mlx-community/GLM-OCR-bf16

    4K
    25
    transformers
    Image To Text

    opendatalab/MinerU-Diffusion-V1-0320-2.5B

    4K
    22
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv3_mobile_det

    3K
    PaddleOCR
    Image To Text

    fxmarty/pix2struct-tiny-random

    3K
    2
    transformers
    Image To Text

    xtuner/llava-llama-3-8b-v1_1-gguf

    3K
    226
    Image To Text

    hezarai/crnn-base-fa-v2

    3K
    9
    hezar
    Image To Text

    kazars24/trocr-base-handwritten-ru

    3K
    16
    transformers
    Image To Text

    naver-clova-ix/donut-base-finetuned-rvlcdip

    3K
    20
    transformers
    Image To Text

    PaddlePaddle/PP-LCNet_x0_25_textline_ori

    3K
    1
    PaddleOCR
    Image To Text

    mradermacher/dots.ocr-i1-GGUF

    3K
    transformers
    Image To Text

    unsloth/GLM-OCR

    3K
    31
    transformers
    Image To Text

    noctrex/PaddleOCR-VL-1.5-GGUF

    3K
    7
    Image To Text

    mradermacher/HunyuanOCR-i1-GGUF

    3K
    1
    transformers
    Image To Text

    google/pix2struct-base

    3K
    79
    transformers
    Image To Text

    microsoft/trocr-base-str

    3K
    6
    transformers
    Image To Text

    PaddlePaddle/arabic_PP-OCRv5_mobile_rec

    3K
    3
    PaddleOCR
    Image To Text

    PaddlePaddle/PP-DocLayout-L

    3K
    5
    PaddleOCR
    Image To Text

    IAMJB/chexpert-mimic-cxr-findings-baseline

    2K
    2
    transformers
    Image To Text

    IAMJB/chexpert-mimic-cxr-impression-baseline

    2K
    transformers
    Image To Text

    KuroTo4ka/Qwen3-VL-8B-Instruct-Unredacted-MAX-Quants-GGUF

    2K
    2
    gguf
    Image To Text

    raxtemur/trocr-base-ru

    2K
    30
    transformers
    Image To Text

    nvidia/nemotron-ocr-v2

    2K
    150
    Image To Text

    xtuner/llava-phi-3-mini-gguf

    2K
    137
    Image To Text

    PaddlePaddle/PP-OCRv4_server_det

    2K
    1
    PaddleOCR
    4 / 13