NEWVectors or files. Pick a path.Start →

    Image Text To Text Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    550 models available

    Showing 97120 of 550 models

    Image Text To Text

    OpenGVLab/InternVL3-8B-AWQ

    555K
    8
    transformers
    Image Text To Text

    ibm-granite/granite-docling-258M

    546K
    1,187
    transformers
    Image Text To Text

    stepfun-ai/Step3-VL-10B

    532K
    406
    Image Text To Text

    HauhauCS/Qwen3.6-27B-Uncensored-HauhauCS-Aggressive

    524K
    423
    Image Text To Text

    google/medgemma-27b-it

    519K
    360
    transformers
    Image Text To Text

    QuantTrio/Qwen3.5-9B-AWQ

    506K
    19
    transformers
    Image Text To Text

    zai-org/GLM-4.1V-9B-Thinking

    503K
    776
    transformers
    Image Text To Text

    Jackrong/Qwopus3.6-35B-A3B-v1-GGUF

    487K
    191
    transformers
    Image Text To Text

    Qwen/Qwen2.5-VL-72B-Instruct

    471K
    625
    transformers
    Image Text To Text

    google/medgemma-4b-it

    464K
    975
    transformers
    Image Text To Text

    meta-llama/Llama-4-Scout-17B-16E-Instruct

    452K
    1,303
    transformers
    Image Text To Text

    trl-internal-testing/tiny-Qwen2_5_VLForConditionalGeneration

    438K
    transformers
    Image Text To Text

    LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Wasserstein-GGUF

    434K
    99
    Image Text To Text

    google/medgemma-1.5-4b-it

    432K
    664
    transformers
    Image Text To Text

    nvidia/NVIDIA-Nemotron-Parse-v1.1

    431K
    169
    transformers
    Image Text To Text

    liuhaotian/llava-v1.5-7b

    427K
    552
    transformers
    Image Text To Text

    Qwen/Qwen2.5-VL-32B-Instruct

    422K
    489
    transformers
    Image Text To Text

    cyankiwi/Qwen3.5-35B-A3B-AWQ-4bit

    416K
    43
    transformers
    Image Text To Text

    HuggingFaceM4/Idefics3-8B-Llama3

    413K
    304
    transformers
    Image Text To Text

    google/gemma-3n-E2B-it

    409K
    303
    transformers
    Image Text To Text

    ISTA-DASLab/gemma-3-27b-it-GPTQ-4b-128g

    391K
    44
    transformers
    Image Text To Text

    Qwen/Qwen3.5-27B-GPTQ-Int4

    387K
    55
    transformers
    Image Text To Text

    OpenGVLab/InternVL2-8B

    384K
    187
    transformers
    Image Text To Text

    unsloth/gemma-4-E4B-it-unsloth-bnb-4bit

    384K
    21
    5 / 23