NEWAgents can now see video via MCP.Try it now →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    202 models available

    Showing 145168 of 202 models

    Visual Question Answering

    MariaK/vilt_finetuned_100

    26
    transformers
    Visual Question Answering

    google/pix2struct-ai2d-large

    25
    4
    transformers
    Visual Question Answering

    TIGER-Lab/VL-Reasoner-72B

    25
    3
    transformers
    Visual Question Answering

    Datadog/Toto-1.0-QA-Experimental

    25
    Visual Question Answering

    erax-ai/EraX-VL-7B-V2.0-Preview

    25
    27
    transformers
    Visual Question Answering

    ZGZzz/SAME

    24
    same
    Visual Question Answering

    ivelin/donut-refexp-combined-v1

    23
    4
    transformers
    Visual Question Answering

    PhelixZhen/Algea-VE

    23
    transformers
    Visual Question Answering

    Nhaass/Qwen3-VL-2B-ChartQA

    23
    2
    transformers
    Visual Question Answering

    SwordElucidator/MiniCPM-Llama3-V-2_5

    22
    transformers
    Visual Question Answering

    Ngoac/EraX-VL-2B-V1.5-Q4_K_M-GGUF

    22
    transformers
    Visual Question Answering

    omlab/VLM-R1-Qwen2.5VL-3B-Math-0305

    22
    8
    Visual Question Answering

    internlm/internlm-xcomposer2d5-ol-7b

    21
    50
    Visual Question Answering

    Cran-May/Shi-Ci-Vision

    21
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base

    21
    1
    transformers
    Visual Question Answering

    MrDevolver/Skywork-R1V3-38B-Q2_K-GGUF

    21
    1
    transformers
    Visual Question Answering

    unum-cloud/uform-gen-chat

    20
    18
    transformers
    Visual Question Answering

    amitha/mllava-baichuan2-en

    20
    transformers
    Visual Question Answering

    Yosemat/designvlm

    20
    1
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-72B

    20
    10
    transformers
    Visual Question Answering

    LeroyDyer/_Spydaz_Web_AI_LlavaNext

    20
    1
    transformers
    Visual Question Answering

    Pankaj121212/blip-2-fine-tuned

    20
    transformers
    Visual Question Answering

    introvoyz041/Ministral-3B-MedVL-Q8_0-GGUF

    20
    Visual Question Answering

    GeorgyGUF/INFRL-Qwen2.5-VL-72B-Preview-q8-with-bf16-output-and-bf16-embedding.gguf

    20
    transformers
    7 / 9