NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 121144 of 550 models

    Image Text To Text

    tencent/HunyuanOCR

    363K
    756
    transformers
    Image Text To Text

    RedHatAI/gemma-3-27b-it-quantized.w4a16

    361K
    13
    transformers
    Image Text To Text

    google/gemma-3-4b-pt

    353K
    155
    transformers
    Image Text To Text

    Jackrong/Qwopus3.6-27B-v1-preview-GGUF

    349K
    124
    transformers
    Image Text To Text

    kakaocorp/kanana-1.5-v-3b-instruct

    344K
    55
    transformers
    Image Text To Text

    unsloth/Qwen3.5-0.8B-GGUF

    341K
    173
    transformers
    Image Text To Text

    cyankiwi/Qwen3.5-397B-A17B-AWQ-4bit

    340K
    2
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-32B-Instruct-FP8

    334K
    46
    transformers
    Image Text To Text

    lightonai/LightOnOCR-2-1B

    334K
    692
    transformers
    Image Text To Text

    OpenGVLab/InternVL3-1B-hf

    331K
    10
    transformers
    Image Text To Text

    Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2

    317K
    122
    Image Text To Text

    Qwen/Qwen3-VL-32B-Thinking-FP8

    313K
    26
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

    310K
    44
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-8B-Thinking

    310K
    210
    transformers
    Image Text To Text

    nvidia/Cosmos-Reason2-8B

    309K
    191
    cosmos
    Image Text To Text

    nvidia/Cosmos-Reason2-2B

    304K
    99
    cosmos
    Image Text To Text

    moonshotai/Kimi-VL-A3B-Instruct

    298K
    267
    transformers
    Image Text To Text

    huihui-ai/Huihui-Qwen3.5-27B-Claude-4.6-Opus-abliterated

    296K
    120
    transformers
    Image Text To Text

    moondream/moondream3-preview

    296K
    650
    transformers
    Image Text To Text

    QuantTrio/Qwen3.5-27B-AWQ

    294K
    43
    transformers
    Image Text To Text

    HuggingFaceTB/SmolVLM2-2.2B-Instruct

    293K
    319
    transformers
    Image Text To Text

    RedHatAI/gemma-4-31B-it-FP8-block

    290K
    30
    transformers
    Image Text To Text

    unsloth/Kimi-K2.6-GGUF

    287K
    159
    transformers
    Image Text To Text

    nanonets/Nanonets-OCR2-3B

    278K
    507
    transformers
    6 / 23