NEWAgents can now see video via MCP.Try it now →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    400 models available

    Showing 313336 of 400 models

    Image Text To Text

    kristaller486/dots.ocr-1.5

    62K
    22
    dots_ocr_1_5
    Image Text To Text

    unsloth/Qwen3.6-35B-A3B-UD-MLX-4bit

    62K
    43
    mlx
    Image Text To Text

    unsloth/Qwen3-VL-4B-Instruct-unsloth-bnb-4bit

    61K
    11
    Image Text To Text

    Qwen/Qwen3.5-35B-A3B-Base

    60K
    129
    transformers
    Image Text To Text

    LiquidAI/LFM2.5-VL-450M-GGUF

    60K
    42
    Image Text To Text

    Qwen/Qwen3-VL-30B-A3B-Thinking

    60K
    197
    transformers
    Image Text To Text

    Qwen/Qwen3.5-2B-Base

    59K
    67
    transformers
    Image Text To Text

    llava-hf/llava-onevision-qwen2-7b-ov-hf

    58K
    38
    transformers
    Image Text To Text

    zai-org/GLM-4.5V

    58K
    718
    transformers
    Image Text To Text

    CohereLabs/command-a-vision-07-2025

    58K
    88
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

    57K
    43
    transformers
    Image Text To Text

    bartowski/Qwen_Qwen3.5-27B-GGUF

    57K
    69
    Image Text To Text

    google/gemma-3-27b-it-qat-q4_0-unquantized

    57K
    41
    transformers
    Image Text To Text

    coder3101/gemma-4-26B-A4B-it-heretic

    57K
    78
    transformers
    Image Text To Text

    unsloth/gemma-4-31B-it

    57K
    14
    Image Text To Text

    mlx-community/Qwen3.6-35B-A3B-4bit

    56K
    23
    mlx
    Image Text To Text

    PerceptronAI/Isaac-0.2-2B-Preview

    55K
    11
    Image Text To Text

    florence-community/Florence-2-large

    55K
    5
    transformers
    Image Text To Text

    MBZUAI/AIN

    54K
    17
    Image Text To Text

    AIDC-AI/Ovis2-1B

    54K
    96
    transformers
    Image Text To Text

    LiquidAI/LFM2-VL-3B-GGUF

    54K
    36
    Image Text To Text

    lmstudio-community/Qwen3-VL-30B-A3B-Instruct-MLX-4bit

    53K
    1
    mlx
    Image Text To Text

    TIGER-Lab/Mantis-8B-siglip-llama3

    53K
    33
    transformers
    Image Text To Text

    OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview

    53K
    82
    transformers
    14 / 17