NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 217240 of 550 models

    Image Text To Text

    hamishivi/Qwen3.5-9B

    160K
    transformers
    Image Text To Text

    unsloth/Qwen3.5-9B

    157K
    20
    transformers
    Image Text To Text

    Qwen/Qwen2.5-VL-32B-Instruct-AWQ

    156K
    62
    transformers
    Image Text To Text

    meta-llama/Llama-Guard-4-12B

    153K
    101
    transformers
    Image Text To Text

    numind/NuExtract-2.0-8B-GPTQ

    152K
    6
    transformers
    Image Text To Text

    OpenGVLab/InternVL3-1B

    152K
    85
    transformers
    Image Text To Text

    HuggingFaceTB/SmolVLM2-256M-Video-Instruct

    152K
    103
    transformers
    Image Text To Text

    OpenGVLab/InternVL2_5-8B-AWQ

    151K
    8
    transformers
    Image Text To Text

    bartowski/Qwen_Qwen3.6-35B-A3B-GGUF

    151K
    111
    Image Text To Text

    OpenGVLab/InternVL3_5-1B-Instruct

    151K
    7
    transformers
    Image Text To Text

    Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

    150K
    2,873
    Image Text To Text

    unsloth/Qwen3.5-4B

    149K
    24
    transformers
    Image Text To Text

    QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ

    148K
    14
    transformers
    Image Text To Text

    allenai/Molmo2-O-7B

    147K
    26
    transformers
    Image Text To Text

    bartowski/google_gemma-4-26B-A4B-it-GGUF

    147K
    130
    Image Text To Text

    stepfun-ai/Step-3.7-Flash-NVFP4

    146K
    44
    transformers
    Image Text To Text

    lmstudio-community/Qwen3-VL-4B-Instruct-MLX-4bit

    145K
    7
    mlx
    Image Text To Text

    lmms-lab/LLaVA-OneVision-1.5-8B-Instruct

    145K
    62
    transformers
    Image Text To Text

    nvidia/NVLM-D-72B

    142K
    776
    transformers
    Image Text To Text

    lmstudio-community/Qwen3-VL-4B-Instruct-MLX-8bit

    141K
    1
    mlx
    Image Text To Text

    lmstudio-community/Qwen3-VL-4B-Instruct-MLX-5bit

    141K
    mlx
    Image Text To Text

    lmstudio-community/Qwen3-VL-4B-Instruct-MLX-6bit

    140K
    mlx
    Image Text To Text

    QuantTrio/Qwen3.5-35B-A3B-AWQ

    140K
    18
    transformers
    Image Text To Text

    unsloth/gemma-4-E2B-it-unsloth-bnb-4bit

    139K
    8
    10 / 23