NEWAgents can now see video via MCP.Try it now →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    400 models available

    Showing 97120 of 400 models

    Image Text To Text

    unsloth/Qwen3.5-2B-GGUF

    316K
    97
    transformers
    Image Text To Text

    lmstudio-community/gemma-3n-E4B-it-MLX-6bit

    316K
    transformers
    Image Text To Text

    Qwen/Qwen3.5-122B-A10B-GPTQ-Int4

    311K
    38
    transformers
    Image Text To Text

    baidu/Qianfan-OCR

    309K
    1,160
    transformers
    Image Text To Text

    bartowski/Qwen_Qwen3.5-35B-A3B-GGUF

    308K
    63
    Image Text To Text

    lmstudio-community/gemma-4-31B-it-MLX-8bit

    308K
    1
    transformers
    Image Text To Text

    HauhauCS/Gemma-4-E2B-Uncensored-HauhauCS-Aggressive

    307K
    138
    Image Text To Text

    QuantTrio/Qwen3.5-35B-A3B-AWQ

    304K
    18
    transformers
    Image Text To Text

    reducto/RolmOCR

    304K
    585
    transformers
    Image Text To Text

    bartowski/google_gemma-4-31B-it-GGUF

    301K
    54
    Image Text To Text

    OpenGVLab/InternVL2_5-8B

    298K
    104
    transformers
    Image Text To Text

    lmstudio-community/gemma-3-4b-it-GGUF

    298K
    27
    Image Text To Text

    Jackrong/Qwopus3.5-9B-v3

    297K
    83
    Image Text To Text

    google/gemma-3n-E2B-it

    297K
    296
    transformers
    Image Text To Text

    unsloth/Qwen3.5-0.8B-GGUF

    296K
    144
    transformers
    Image Text To Text

    moonshotai/Kimi-K2.6

    292K
    1,007
    transformers
    Image Text To Text

    lovedheart/Qwen3.5-9B-FP8

    285K
    10
    transformers
    Image Text To Text

    QuantTrio/Qwen3.5-9B-AWQ

    283K
    12
    transformers
    Image Text To Text

    lmstudio-community/gemma-4-31B-it-MLX-4bit

    280K
    transformers
    Image Text To Text

    bartowski/google_gemma-4-26B-A4B-it-GGUF

    278K
    109
    Image Text To Text

    Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

    274K
    106
    transformers
    Image Text To Text

    stepfun-ai/Step3-VL-10B

    273K
    405
    Image Text To Text

    lmstudio-community/gemma-4-31B-it-MLX-6bit

    271K
    transformers
    Image Text To Text

    mlabonne/gemma-3-27b-it-abliterated

    271K
    315
    transformers
    5 / 17