NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 4972 of 550 models

    Image Text To Text

    unsloth/gemma-4-E2B-it-GGUF

    1.2M
    236
    Image Text To Text

    nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1

    1.2M
    180
    transformers
    Image Text To Text

    unsloth/Qwen3.6-27B-MTP-GGUF

    1.2M
    688
    transformers
    Image Text To Text

    cyankiwi/Qwen3.5-4B-AWQ-4bit

    1.1M
    14
    transformers
    Image Text To Text

    unsloth/Qwen3.5-9B-GGUF

    1.1M
    663
    transformers
    Image Text To Text

    cyankiwi/gemma-4-31B-it-AWQ-4bit

    1.1M
    48
    transformers
    Image Text To Text

    Qwen/Qwen3.5-397B-A17B

    1.1M
    1,504
    transformers
    Image Text To Text

    unsloth/gemma-4-E4B-it-GGUF

    1.1M
    483
    Image Text To Text

    cyankiwi/Qwen3.5-9B-AWQ-4bit

    1.0M
    30
    transformers
    Image Text To Text

    Qwen/Qwen2.5-VL-7B-Instruct-AWQ

    1.0M
    105
    transformers
    Image Text To Text

    unsloth/Qwen3.6-35B-A3B-MTP-GGUF

    983K
    470
    transformers
    Image Text To Text

    Qwen/Qwen3.5-397B-A17B-FP8

    963K
    175
    transformers
    Image Text To Text

    LGAI-EXAONE/EXAONE-4.5-33B

    945K
    161
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-30B-A3B-Instruct

    941K
    577
    transformers
    Image Text To Text

    OpenGVLab/InternVL2-1B

    916K
    81
    transformers
    Image Text To Text

    llava-hf/llava-onevision-qwen2-0.5b-ov-hf

    909K
    55
    transformers
    Image Text To Text

    HuggingFaceTB/SmolVLM-256M-Instruct

    890K
    364
    transformers
    Image Text To Text

    deepseek-ai/deepseek-vl2-tiny

    843K
    248
    transformers
    Image Text To Text

    AxionML/Qwen3.5-9B-NVFP4

    829K
    17
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-8B-Instruct-FP8

    821K
    71
    transformers
    Image Text To Text

    Qwen/Qwen3.5-122B-A10B

    816K
    564
    transformers
    Image Text To Text

    unsloth/Qwen3.5-4B-GGUF

    786K
    268
    transformers
    Image Text To Text

    Qwen/Qwen3.5-35B-A3B-GPTQ-Int4

    775K
    87
    transformers
    Image Text To Text

    LiquidAI/LFM2.5-VL-450M

    769K
    183
    transformers
    3 / 23