NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 409432 of 550 models

    Image Text To Text

    optimum-intel-internal-testing/tiny-random-llava-next-mistral

    69K
    transformers
    Image Text To Text

    microsoft/Phi-4-reasoning-vision-15B

    68K
    171
    Image Text To Text

    lmstudio-community/gemma-4-31B-it-MLX-4bit

    68K
    1
    transformers
    Image Text To Text

    kristaller486/dots.ocr-1.5

    67K
    23
    dots_ocr_1_5
    Image Text To Text

    Qwen/Qwen3-VL-32B-Thinking

    66K
    87
    transformers
    Image Text To Text

    openvla/openvla-7b-finetuned-libero-object

    66K
    1
    transformers
    Image Text To Text

    huihui-ai/Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-MTP-GGUF

    66K
    39
    transformers
    Image Text To Text

    unsloth/Qwen3.6-27B

    66K
    27
    transformers
    Image Text To Text

    QuantTrio/Qwen3.5-4B-AWQ

    65K
    8
    transformers
    Image Text To Text

    Qwen/Qwen3-VL-8B-Thinking-FP8

    65K
    32
    transformers
    Image Text To Text

    havenoammo/Qwen3.6-27B-MTP-UD-GGUF

    65K
    100
    transformers
    Image Text To Text

    Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF

    65K
    607
    Image Text To Text

    bartowski/Qwen_Qwen3.5-397B-A17B-GGUF

    65K
    10
    Image Text To Text

    OpenGVLab/InternVL3_5-14B

    64K
    30
    transformers
    Image Text To Text

    yujiepan/ui-tars-1.5-7B-GPTQ-W4A16g128

    64K
    1
    Image Text To Text

    unsloth/Qwen3.5-35B-A3B

    64K
    15
    transformers
    Image Text To Text

    unsloth/Qwen3-VL-8B-Instruct

    64K
    9
    transformers
    Image Text To Text

    ggml-org/gemma-3-4b-it-GGUF

    64K
    54
    Image Text To Text

    unsloth/Qwen3.6-35B-A3B

    63K
    24
    transformers
    Image Text To Text

    DavidAU/Qwen3.5-9B-Claude-4.6-OS-Auto-Variable-HERETIC-UNCENSORED-THINKING-MAX-NEOCODE-Imatrix-GGUF

    63K
    101
    transformers
    Image Text To Text

    Chunity/gemma-4-E4B-it-AWQ-4bit

    63K
    3
    transformers
    Image Text To Text

    stelterlab/Mistral-Small-3.2-24B-Instruct-2506-FP8

    63K
    8
    vllm
    Image Text To Text

    dengcao/GLM-4.1V-9B-Thinking-AWQ

    63K
    1
    transformers
    Image Text To Text

    LiquidAI/LFM2.5-VL-450M-GGUF

    63K
    43
    18 / 23