NEWAgents can now see video via MCP.Try it now →

    Image To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 193216 of 300 models

    Image To Text

    Mungert/LightOnOCR-1B-1025-GGUF

    412
    1
    vllm
    Image To Text

    mradermacher/old-church-slavonic-dots-ocr-GGUF

    412
    transformers
    Image To Text

    chatpig/llava-llama3

    411
    4
    Image To Text

    SamMikaelson/deepseek-ocr-qvlm-4bit

    410
    transformers
    Image To Text

    nvidia/nemotron-ocr-v1

    405
    118
    Image To Text

    PaddlePaddle/japan_PP-OCRv3_mobile_rec

    400
    PaddleOCR
    Image To Text

    a0a7/gregg-recognition

    395
    2
    pytorch
    Image To Text

    PaddlePaddle/cyrillic_PP-OCRv3_mobile_rec

    392
    PaddleOCR
    Image To Text

    nyu-visionx/Cambrian-S-0.5B

    381
    2
    transformers
    Image To Text

    PaddlePaddle/RT-DETR-H_layout_3cls

    380
    PaddleOCR
    Image To Text

    EZCon/GLM-OCR-4bit-mlx

    378
    mlx
    Image To Text

    Hyphonical/Pixtral-12B-Captioner-Relaxed-Q4_K_M-GGUF

    375
    1
    transformers
    Image To Text

    samuraieng/sarashina2.2-vision-3b-gguf

    375
    Image To Text

    kkatiz/THAI-BLIP-2

    372
    8
    transformers
    Image To Text

    mrrtmob/kiri-ocr

    361
    9
    kiri-ocr
    Image To Text

    mradermacher/Qwen2.5-VL-3B-Abliterated-Caption-it-GGUF

    357
    2
    transformers
    Image To Text

    tuman/vit-rugpt2-image-captioning

    353
    13
    transformers
    Image To Text

    noctrex/LightOnOCR-2-1B-bbox-GGUF

    350
    Image To Text

    IDEA-CCNL/Taiyi-BLIP-750M-Chinese

    339
    15
    transformers
    Image To Text

    StanfordAIMI/CheXagent-2-3b-srrg-findings

    337
    1
    transformers
    Image To Text

    cnmoro/tiny-image-captioning

    332
    3
    transformers
    Image To Text

    InternScience/StructTable-InternVL2-1B

    322
    43
    Image To Text

    katanaml-org/invoices-donut-model-v1

    319
    40
    transformers
    Image To Text

    manu02/LAnA-v5

    319
    transformers
    9 / 13