NEWAgents can now see video via MCP.Try it now →

    Image To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 289300 of 300 models

    Image To Text

    PaddlePaddle/UVDoc_safetensors

    160
    1
    PaddleOCR
    Image To Text

    TIGER-Lab/RationalRewards-8B-T2I

    160
    4
    transformers
    Image To Text

    Xenova/trocr-base-handwritten

    159
    4
    transformers.js
    Image To Text

    noctrex/Chandra-OCR-i1-GGUF

    158
    1
    Image To Text

    YouLiXiya/tinyllava-v1.0-1.1b-hf

    157
    4
    transformers
    Image To Text

    manu02/LAnA-Arxiv

    157
    transformers
    Image To Text

    PaddlePaddle/ka_PP-OCRv3_mobile_rec

    156
    PaddleOCR
    Image To Text

    BabaK07/textract-ai

    155
    1
    transformers
    Image To Text

    Flova/omr_transformer

    152
    12
    transformers
    Image To Text

    PaddlePaddle/PP-DocLayout-M

    152
    PaddleOCR
    Image To Text

    microsoft/git-large-r-coco

    151
    11
    transformers
    Image To Text

    mlx-community/GLM-OCR-5bit

    150
    transformers
    13 / 13