NEWAgents can now see video via MCP.Try it now →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    400 models available

    Showing 169192 of 400 models

    Image Text To Text

    Qwen/Qwen2.5-VL-72B-Instruct-AWQ

    168K
    71
    transformers
    Image Text To Text

    bartowski/Qwen_Qwen3.5-2B-GGUF

    166K
    14
    Image Text To Text

    Qwen/Qwen2.5-VL-32B-Instruct-AWQ

    166K
    61
    transformers
    Image Text To Text

    Jackrong/Qwopus3.5-9B-v3-GGUF

    165K
    320
    Image Text To Text

    huihui-ai/Huihui-Qwen3.5-35B-A3B-Claude-4.6-Opus-abliterated

    163K
    41
    transformers
    Image Text To Text

    stepfun-ai/step3

    161K
    166
    transformers
    Image Text To Text

    cyankiwi/Qwen3.6-35B-A3B-AWQ-4bit

    161K
    38
    transformers
    Image Text To Text

    google/gemma-3-4b-pt

    161K
    154
    transformers
    Image Text To Text

    unsloth/gemma-4-E2B-it

    160K
    10
    Image Text To Text

    cyankiwi/Qwen3.5-27B-AWQ-4bit

    153K
    38
    transformers
    Image Text To Text

    google/medgemma-27b-it

    152K
    344
    transformers
    Image Text To Text

    RedHatAI/gemma-4-31B-it-FP8-block

    146K
    17
    transformers
    Image Text To Text

    bartowski/google_gemma-4-E2B-it-GGUF

    146K
    26
    Image Text To Text

    internlm/Intern-S1-Pro

    146K
    276
    transformers
    Image Text To Text

    HuggingFaceTB/SmolVLM2-2.2B-Instruct

    144K
    314
    transformers
    Image Text To Text

    janhq/Jan-v2-VL-high-gguf

    142K
    37
    transformers
    Image Text To Text

    QuantTrio/gemma-4-31B-it-AWQ

    139K
    10
    transformers
    Image Text To Text

    unsloth/gemma-3-12b-it-GGUF

    139K
    183
    transformers
    Image Text To Text

    Qwen/Qwen3.5-0.8B-Base

    139K
    70
    transformers
    Image Text To Text

    openbmb/MiniCPM-V-2_6

    137K
    1,038
    transformers
    Image Text To Text

    INSAIT-Institute/BgGPT-Gemma-3-12B-IT

    136K
    4
    transformers
    Image Text To Text

    unsloth/Qwen3-VL-4B-Instruct

    135K
    10
    transformers
    Image Text To Text

    xlangai/OpenCUA-7B

    135K
    29
    transformers
    Image Text To Text

    meta-llama/Llama-3.2-11B-Vision-Instruct

    135K
    1,587
    transformers
    8 / 17