NEWAgents can now see video via MCP.Try it now →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    400 models available

    Showing 7396 of 400 models

    Image Text To Text

    google/gemma-3-27b-it

    484K
    1,956
    transformers
    Image Text To Text

    llava-hf/llava-onevision-qwen2-0.5b-ov-hf

    483K
    55
    transformers
    Image Text To Text

    cyankiwi/Qwen3.5-35B-A3B-AWQ-4bit

    479K
    40
    transformers
    Image Text To Text

    OpenGVLab/InternVL2-8B

    476K
    187
    transformers
    Image Text To Text

    Salesforce/blip2-opt-2.7b

    464K
    439
    transformers
    Image Text To Text

    unsloth/Qwen3.6-27B-GGUF

    458K
    395
    transformers
    Image Text To Text

    Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled

    451K
    56
    Image Text To Text

    Qwen/Qwen3-VL-4B-Thinking

    444K
    109
    transformers
    Image Text To Text

    HuggingFaceTB/SmolVLM2-500M-Video-Instruct

    423K
    133
    transformers
    Image Text To Text

    HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive

    419K
    425
    Image Text To Text

    Qwen/Qwen3-VL-8B-Instruct-FP8

    418K
    68
    transformers
    Image Text To Text

    meta-llama/Llama-4-Scout-17B-16E-Instruct

    410K
    1,276
    transformers
    Image Text To Text

    google/gemma-4-31B

    397K
    331
    transformers
    Image Text To Text

    Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF

    397K
    598
    Image Text To Text

    Qwen/Qwen3-VL-8B-Thinking

    396K
    203
    transformers
    Image Text To Text

    QuantTrio/Qwen3.5-27B-AWQ

    392K
    43
    transformers
    Image Text To Text

    zai-org/GLM-4.1V-9B-Thinking

    372K
    777
    transformers
    Image Text To Text

    Qwen/Qwen3.5-27B-GPTQ-Int4

    371K
    51
    transformers
    Image Text To Text

    Qwen/Qwen3.6-27B-FP8

    347K
    140
    transformers
    Image Text To Text

    unsloth/gemma-4-31B-it-unsloth-bnb-4bit

    341K
    11
    Image Text To Text

    ISTA-DASLab/gemma-3-27b-it-GPTQ-4b-128g

    337K
    44
    transformers
    Image Text To Text

    lmstudio-community/gemma-3n-E4B-it-MLX-4bit

    336K
    2
    transformers
    Image Text To Text

    lmstudio-community/gemma-3n-E4B-it-MLX-bf16

    320K
    3
    transformers
    Image Text To Text

    lmstudio-community/gemma-3n-E4B-it-MLX-8bit

    319K
    transformers
    4 / 17