NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 529550 of 550 models

    Image Text To Text

    Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int4

    42K
    38
    transformers
    Image Text To Text

    google/gemma-3n-E4B-it

    42K
    910
    transformers
    Image Text To Text

    meta-llama/Llama-4-Maverick-17B-128E-Instruct

    42K
    480
    transformers
    Image Text To Text

    zai-org/GLM-4.6V-Flash

    42K
    600
    transformers
    Image Text To Text

    openbmb/MiniCPM-Llama3-V-2_5

    41K
    1,411
    transformers
    Image Text To Text

    OpenGVLab/InternVL3-78B

    41K
    234
    transformers
    Image Text To Text

    winninghealth/olmOCR-2-7B-1025-INT4

    41K
    transformers
    Image Text To Text

    gokaygokay/Florence-2-SD3-Captioner

    41K
    41
    transformers
    Image Text To Text

    unsloth/Qwen3-VL-2B-Instruct-unsloth-bnb-4bit

    40K
    7
    transformers
    Image Text To Text

    Momix-44/gemma-4-31B-it-heretic-v2

    40K
    4
    transformers
    Image Text To Text

    gokaygokay/Florence-2-Flux

    40K
    14
    transformers
    Image Text To Text

    mlx-community/Qwen3.5-35B-A3B-4bit

    40K
    36
    transformers
    Image Text To Text

    mlx-community/gemma-4-31b-8bit

    39K
    20
    mlx
    Image Text To Text

    internlm/Intern-S1

    39K
    258
    transformers
    Image Text To Text

    OpenGVLab/InternVL2_5-4B-AWQ

    39K
    7
    transformers
    Image Text To Text

    AIDC-AI/Ovis2.6-30B-A3B

    38K
    143
    Image Text To Text

    docling-project/SmolDocling-256M-preview

    38K
    1,614
    transformers
    Image Text To Text

    OpenGVLab/InternVL3-8B-hf

    38K
    9
    transformers
    Image Text To Text

    apolo13x/Qwen3.5-35B-A3B-NVFP4

    38K
    15
    transformers
    Image Text To Text

    Jackrong/Qwopus3.5-4B-v3-GGUF

    37K
    41
    Image Text To Text

    llava-hf/llama3-llava-next-8b-hf

    37K
    51
    transformers
    Image Text To Text

    unsloth/Qwen3-VL-8B-Instruct-unsloth-bnb-4bit

    37K
    19
    23 / 23