NEWAgents can now see video via MCP.Try it now →

    Image To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 4972 of 300 models

    Image To Text

    Riksarkivet/trocr-base-handwritten-hist-swe-2

    11K
    12
    htrflow
    Image To Text

    PaddlePaddle/PP-DocBlockLayout

    11K
    3
    PaddleOCR
    Image To Text

    PaddlePaddle/korean_PP-OCRv5_mobile_rec

    11K
    13
    PaddleOCR
    Image To Text

    microsoft/trocr-base-stage1

    10K
    17
    transformers
    Image To Text

    alephpi/FormulaNet

    10K
    2
    Image To Text

    PaddlePaddle/SLANeXt_wired

    10K
    1
    PaddleOCR
    Image To Text

    PaddlePaddle/SLANet_plus

    8K
    PaddleOCR
    Image To Text

    google/pix2struct-textcaps-base

    8K
    29
    transformers
    Image To Text

    logasanjeev/indian-id-validator

    8K
    5
    ultralytics
    Image To Text

    PaddlePaddle/en_PP-OCRv4_mobile_rec

    7K
    3
    PaddleOCR
    Image To Text

    PaddlePaddle/PP-FormulaNet_plus-L

    7K
    3
    PaddleOCR
    Image To Text

    PaddlePaddle/eslav_PP-OCRv5_mobile_rec

    6K
    1
    PaddleOCR
    Image To Text

    noctrex/LightOnOCR-2-1B-ocr-soup-GGUF

    6K
    7
    Image To Text

    mradermacher/Qwen2.5-VL-7B-Abliterated-Caption-it-GGUF

    6K
    70
    transformers
    Image To Text

    PaddlePaddle/PP-Chart2Table

    6K
    3
    PaddleOCR
    Image To Text

    microsoft/git-large-coco

    6K
    105
    transformers
    Image To Text

    PaddlePaddle/PP-OCRv4_mobile_rec

    6K
    2
    PaddleOCR
    Image To Text

    ydshieh/vit-gpt2-coco-en

    6K
    39
    transformers
    Image To Text

    lolzinventor/Qwen3.5-4B-Base-ZitGen-V1

    6K
    13
    Image To Text

    mradermacher/Hulu-Med-Flash-Preview-27B-i1-GGUF

    5K
    1
    transformers
    Image To Text

    Xenova/vit-gpt2-image-captioning

    5K
    28
    transformers.js
    Image To Text

    microsoft/git-base-coco

    4K
    21
    transformers
    Image To Text

    noctrex/LightOnOCR-2-1B-GGUF

    4K
    27
    Image To Text

    Norm/nougat-latex-base

    4K
    82
    transformers
    3 / 13