NEWAgents can now see video via MCP.Try it now →

    Image To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 97120 of 300 models

    Image To Text

    cyberagent/llava-calm2-siglip

    2K
    26
    transformers
    Image To Text

    qualcomm/TrOCR

    2K
    15
    pytorch
    Image To Text

    breezedeus/pix2text-mfr-1.5

    2K
    transformers
    Image To Text

    EasyDeL/gemma-4-31B-it

    2K
    easydel
    Image To Text

    PaddlePaddle/en_PP-OCRv3_mobile_rec

    2K
    PaddleOCR
    Image To Text

    PaddlePaddle/SLANeXt_wireless

    2K
    PaddleOCR
    Image To Text

    EZCon/GLM-OCR-4bit-g32-mxfp4-mixed_4_8-mlx

    2K
    5
    mlx
    Image To Text

    kishlay9890/chandra-gptq-4bit

    2K
    Image To Text

    PaddlePaddle/th_PP-OCRv5_mobile_rec

    2K
    2
    PaddleOCR
    Image To Text

    openthaigpt/thai-trocr

    2K
    25
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv4_server_rec_doc

    2K
    1
    PaddleOCR
    Image To Text

    mradermacher/Gliese-OCR-7B-Post2.0-final-i1-GGUF

    2K
    1
    transformers
    Image To Text

    mradermacher/Hulu-Med-Flash-Preview-27B-GGUF

    1K
    transformers
    Image To Text

    unography/blip-large-long-cap

    1K
    5
    transformers
    Image To Text

    thwri/CogFlorence-2.2-Large

    1K
    44
    transformers
    Image To Text

    sbintuitions/sarashina2.2-ocr

    1K
    26
    transformers
    Image To Text

    mayocream/manga-ocr

    1K
    2
    Image To Text

    PaddlePaddle/devanagari_PP-OCRv5_mobile_rec

    1K
    PaddleOCR
    Image To Text

    mradermacher/dots.ocr-GGUF

    1K
    1
    transformers
    Image To Text

    noamrot/FuseCap_Image_Captioning

    1K
    23
    transformers
    Image To Text

    PaddlePaddle/latin_PP-OCRv3_mobile_rec

    1K
    PaddleOCR
    Image To Text

    PaddlePaddle/PP-OCRv4_server_rec

    1K
    1
    PaddleOCR
    Image To Text

    mradermacher/WR30a-Deep-7B-0711-i1-GGUF

    1K
    1
    transformers
    Image To Text

    PaddlePaddle/cyrillic_PP-OCRv5_mobile_rec

    1K
    PaddleOCR
    5 / 13