NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 481504 of 550 models

    Image Text To Text

    OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview-HF

    53K
    9
    transformers
    Image Text To Text

    AIDC-AI/Ovis2-4B

    53K
    63
    transformers
    Image Text To Text

    Vishva007/gemma-4-E4B-it-W4A16-AutoRound-GPTQ

    53K
    4
    transformers
    Image Text To Text

    OpenGVLab/InternVL3_5-4B

    53K
    24
    transformers
    Image Text To Text

    llmfan46/Qwen3.6-35B-A3B-uncensored-heretic-GGUF

    53K
    77
    transformers
    Image Text To Text

    Qwen/Qwen2-VL-2B-Instruct-AWQ

    53K
    24
    Image Text To Text

    HauhauCS/Qwen3.5-122B-A10B-Uncensored-HauhauCS-Aggressive

    53K
    117
    Image Text To Text

    coder3101/gemma-4-26B-A4B-it-heretic

    52K
    84
    transformers
    Image Text To Text

    mlx-community/gemma-3-27b-it-qat-4bit

    52K
    23
    transformers
    Image Text To Text

    unsloth/Qwen3-VL-4B-Instruct-unsloth-bnb-4bit

    52K
    11
    Image Text To Text

    typhoon-ai/typhoon-ocr1.5-2b

    52K
    20
    transformers
    Image Text To Text

    LiquidAI/LFM2-VL-3B-GGUF

    52K
    36
    Image Text To Text

    llmfan46/Qwen3.6-35B-A3B-uncensored-heretic

    51K
    83
    transformers
    Image Text To Text

    unsloth/gemma-3-4b-it-GGUF

    51K
    188
    transformers
    Image Text To Text

    mlx-community/Qwen3.5-9B-MLX-4bit

    51K
    119
    mlx
    Image Text To Text

    PaddlePaddle/PaddleOCR-VL-1.5

    51K
    613
    PaddleOCR
    Image Text To Text

    allenai/Molmo2-4B

    51K
    48
    transformers
    Image Text To Text

    MBZUAI/AIN

    50K
    17
    Image Text To Text

    Qwen/Qwen3-VL-30B-A3B-Thinking

    50K
    198
    transformers
    Image Text To Text

    unsloth/Qwen3-VL-8B-Instruct-GGUF

    50K
    43
    transformers
    Image Text To Text

    google/t5gemma-2-1b-1b

    50K
    77
    transformers
    Image Text To Text

    unsloth/Qwen3.6-27B-MLX-8bit

    49K
    28
    mlx
    Image Text To Text

    Qwen/Qwen3.5-397B-A17B-GPTQ-Int4

    49K
    28
    transformers
    Image Text To Text

    unsloth/gemma-4-31B-it

    49K
    17
    21 / 23