NEWAgents can now see video via MCP.Try it now →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    400 models available

    Showing 361384 of 400 models

    Image Text To Text

    bartowski/google_gemma-3-4b-it-GGUF

    44K
    36
    Image Text To Text

    openbmb/MiniCPM-Llama3-V-2_5

    44K
    1,411
    transformers
    Image Text To Text

    PerceptronAI/Isaac-0.1

    44K
    115
    transformers
    Image Text To Text

    lmstudio-community/Qwen2.5-VL-7B-Instruct-GGUF

    43K
    7
    Image Text To Text

    google/paligemma2-3b-ft-docci-448

    43K
    13
    transformers
    Image Text To Text

    Lightricks/gemma-3-12b-it-qat-q4_0-unquantized

    43K
    6
    transformers
    Image Text To Text

    OpenGVLab/InternVL3_5-8B

    43K
    97
    transformers
    Image Text To Text

    gokaygokay/Florence-2-SD3-Captioner

    43K
    41
    transformers
    Image Text To Text

    google/gemma-3n-E4B-it

    43K
    907
    transformers
    Image Text To Text

    arsovskidev/Gemma-4-E4B-Claude-4.6-Opus-Reasoning-Distilled

    42K
    11
    transformers
    Image Text To Text

    gokaygokay/Florence-2-Flux

    42K
    14
    transformers
    Image Text To Text

    ByteDance-Seed/UI-TARS-1.5-7B

    41K
    538
    transformers
    Image Text To Text

    llava-hf/llava-v1.6-vicuna-7b-hf

    41K
    30
    transformers
    Image Text To Text

    groxaxo/Huihui-gemma-4-26B-A4B-it-abliterated-GGUF

    41K
    11
    llama.cpp
    Image Text To Text

    QuantTrio/Qwen3.5-4B-AWQ

    41K
    8
    transformers
    Image Text To Text

    internlm/Intern-S1

    41K
    258
    transformers
    Image Text To Text

    unsloth/Qwen3-VL-2B-Instruct-unsloth-bnb-4bit

    40K
    7
    transformers
    Image Text To Text

    Momix-44/gemma-4-31B-it-heretic-v2

    40K
    3
    transformers
    Image Text To Text

    dealignai/Gemma-4-26B-A4B-JANG_4M-CRACK

    40K
    78
    mlx
    Image Text To Text

    AIDC-AI/Ovis1.6-Llama3.2-3B

    40K
    49
    transformers
    Image Text To Text

    Isotr0py/deepseek-vl2-tiny

    40K
    transformers
    Image Text To Text

    AIDC-AI/Ovis1.6-Gemma2-9B

    40K
    273
    transformers
    Image Text To Text

    OpenGVLab/InternVL2_5-4B

    40K
    57
    transformers
    Image Text To Text

    RedHatAI/gemma-3-27b-it-FP8-dynamic

    40K
    13
    transformers
    16 / 17