NEWAgents can now see video via MCP.Try it now →

    Image To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 121144 of 300 models

    Image To Text

    mradermacher/Qwen2-VL-2B-Abliterated-Caption-it-GGUF

    1K
    2
    transformers
    Image To Text

    ddobokki/ko-trocr

    1K
    33
    transformers
    Image To Text

    fhswf/TrOCR_german_handwritten

    1K
    13
    transformers
    Image To Text

    mradermacher/Qwen3.5-4B-Base-ZitGen-V1-GGUF

    1K
    transformers
    Image To Text

    microsoft/git-large-textcaps

    1K
    31
    transformers
    Image To Text

    kpyu/video-blip-opt-2.7b-ego4d

    988
    20
    transformers
    Image To Text

    mradermacher/Fast-PaddleOCR-VL-1.5-i1-GGUF

    961
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv4_server_seal_det

    959
    1
    PaddleOCR
    Image To Text

    microsoft/trocr-large-stage1

    911
    27
    transformers
    Image To Text

    mradermacher/Hulu-Med-235A22-i1-GGUF

    909
    1
    transformers
    Image To Text

    xtuner/llava-phi-3-mini-hf

    902
    53
    transformers
    Image To Text

    antoniorv6/smt-grandstaff

    891
    6
    Image To Text

    noctrex/Chandra-OCR-GGUF

    888
    15
    Image To Text

    Z3NN001/gemma4-21b-a4b-REAP-it-mlx-Q4

    885
    4
    mlx
    Image To Text

    ShayanCyan/phi4-multimodal-quantisized-gguf

    868
    6
    other
    Image To Text

    mradermacher/QwenStoryteller-GGUF

    856
    transformers
    Image To Text

    breezedeus/pix2text-mfd-1.5

    855
    Image To Text

    Abiray/Qianfan-OCR-GGUF

    835
    1
    gguf
    Image To Text

    mlx-community/GLM-OCR-4bit

    828
    4
    transformers
    Image To Text

    noctrex/LightOnOCR-1B-1025-GGUF

    817
    3
    Image To Text

    mradermacher/Nanonets-OCR2-3B-GGUF

    786
    15
    transformers
    Image To Text

    sbintuitions/sarashina2.2-vision-3b

    775
    17
    transformers
    Image To Text

    mlx-community/GLM-OCR-8bit

    722
    4
    transformers
    Image To Text

    mradermacher/Qwen3-VL-8B-Abliterated-Caption-it-GGUF

    706
    5
    transformers
    6 / 13