NEWVectors or files. Pick a path.Start →

    Image To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    402 models available

    Showing 7396 of 402 models

    Image To Text

    google/pix2struct-textcaps-base

    5K
    29
    transformers
    Image To Text

    microsoft/trocr-base-str

    5K
    6
    transformers
    Image To Text

    Xenova/vit-gpt2-image-captioning

    4K
    29
    transformers.js
    Image To Text

    KuroTo4ka/Qwen3-VL-8B-Instruct-Unredacted-MAX-Quants-GGUF

    4K
    6
    gguf
    Image To Text

    breezedeus/pix2text-mfr

    4K
    56
    transformers
    Image To Text

    microsoft/git-large-coco

    4K
    105
    transformers
    Image To Text

    PaddlePaddle/devanagari_PP-OCRv5_mobile_rec

    4K
    PaddleOCR
    Image To Text

    xtuner/llava-llama-3-8b-v1_1-gguf

    4K
    225
    Image To Text

    PaddlePaddle/PP-DocLayout-L

    4K
    5
    PaddleOCR
    Image To Text

    unsloth/GLM-OCR

    4K
    33
    transformers
    Image To Text

    PaddlePaddle/arabic_PP-OCRv5_mobile_rec

    4K
    4
    PaddleOCR
    Image To Text

    google/pix2struct-base

    4K
    79
    transformers
    Image To Text

    PaddlePaddle/en_PP-OCRv3_mobile_rec

    4K
    1
    PaddleOCR
    Image To Text

    nvidia/nemotron-ocr-v1

    4K
    119
    Image To Text

    logasanjeev/indian-id-validator

    3K
    5
    ultralytics
    Image To Text

    fxmarty/pix2struct-tiny-random

    3K
    2
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv4_server_det

    3K
    2
    PaddleOCR
    Image To Text

    PaddlePaddle/SLANeXt_wireless

    3K
    1
    PaddleOCR
    Image To Text

    mradermacher/HunyuanOCR-GGUF

    3K
    transformers
    Image To Text

    Norm/nougat-latex-base

    3K
    82
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv4_server_rec_doc

    3K
    2
    PaddleOCR
    Image To Text

    mlx-community/GLM-OCR-bf16

    3K
    29
    transformers
    Image To Text

    PaddlePaddle/arabic_PP-OCRv3_mobile_rec

    3K
    3
    PaddleOCR
    Image To Text

    PaddlePaddle/th_PP-OCRv5_mobile_rec

    3K
    2
    PaddleOCR
    4 / 17