NEWAgents can now see video via MCP.Try it now →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    400 models available

    Showing 2548 of 400 models

    Image Text To Text

    unsloth/gemma-4-E4B-it-GGUF

    1.6M
    331
    Image Text To Text

    Qwen/Qwen3.5-35B-A3B-FP8

    1.6M
    146
    transformers
    Image Text To Text

    opendatalab/MinerU2.5-2509-1.2B

    1.6M
    355
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-32B-Instruct

    1.6M
    197
    transformers
    Image Text To Text

    microsoft/Phi-3.5-vision-instruct

    1.5M
    732
    transformers
    Image Text To Text

    deepseek-ai/DeepSeek-OCR-2

    1.5M
    926
    transformers
    Image Text To Text

    unsloth/Qwen3.6-35B-A3B-GGUF

    1.5M
    754
    transformers
    Image Text To Text

    HauhauCS/Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive

    1.4M
    1,364
    Image Text To Text

    Qwen/Qwen3.5-397B-A17B-FP8

    1.3M
    163
    transformers
    Image Text To Text

    unsloth/Qwen3.5-35B-A3B-GGUF

    1.2M
    832
    Image Text To Text

    Qwen/Qwen3-VL-235B-A22B-Instruct

    1.2M
    383
    transformers
    Image Text To Text

    Qwen/Qwen3.5-27B-FP8

    1.1M
    130
    transformers
    Image Text To Text

    OpenGVLab/InternVL2-2B

    1.1M
    80
    transformers
    Image Text To Text

    Qwen/Qwen3.5-122B-A10B

    1.1M
    527
    transformers
    Image Text To Text

    Qwen/Qwen3.6-35B-A3B

    1.0M
    1,393
    transformers
    Image Text To Text

    Qwen/Qwen3.6-35B-A3B-FP8

    1.0M
    174
    transformers
    Image Text To Text

    microsoft/Florence-2-large

    994K
    1,800
    transformers
    Image Text To Text

    nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1

    960K
    177
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-30B-A3B-Instruct

    958K
    565
    transformers
    Image Text To Text

    unsloth/Qwen3.5-9B-GGUF

    939K
    549
    transformers
    Image Text To Text

    HauhauCS/Gemma-4-E4B-Uncensored-HauhauCS-Aggressive

    938K
    476
    Image Text To Text

    cyankiwi/gemma-4-26B-A4B-it-AWQ-4bit

    935K
    48
    transformers
    Image Text To Text

    unsloth/gemma-4-E2B-it-GGUF

    929K
    157
    Image Text To Text

    Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF

    893K
    639
    2 / 17