NEWAgents can now see video via MCP.Try it now →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    202 models available

    Showing 124 of 202 models

    Visual Question Answering

    Salesforce/blip-vqa-base

    188K
    190
    transformers
    Visual Question Answering

    dandelin/vilt-b32-finetuned-vqa

    177K
    421
    transformers
    Visual Question Answering

    openbmb/MiniCPM-V-2

    58K
    495
    transformers
    Visual Question Answering

    google/deplot

    23K
    316
    transformers
    Visual Question Answering

    Salesforce/blip-vqa-capfilt-large

    18K
    53
    transformers
    Visual Question Answering

    Lin-Chen/sharegpt4video-8b

    13K
    45
    transformers
    Visual Question Answering

    chaoyinshe/llava-med-v1.5-mistral-7b-hf

    4K
    6
    Visual Question Answering

    google/pix2struct-docvqa-base

    2K
    44
    transformers
    Visual Question Answering

    TIGER-Lab/VideoScore2

    2K
    3
    Visual Question Answering

    openbmb/MiniCPM-V

    2K
    201
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2-vl-7b

    2K
    84
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2-4khd-7b

    1K
    73
    transformers
    Visual Question Answering

    google/pix2struct-ai2d-base

    1K
    43
    transformers
    Visual Question Answering

    ricoh-ai/Qwen-3-VL-Ricoh-8B-20260227

    1K
    13
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2.1-7B-AV

    1K
    16
    transformers
    Visual Question Answering

    second-state/MiniCPM-V-4_5-GGUF

    641
    14
    Visual Question Answering

    LZXzju/Qwen2.5-VL-3B-UI-R1-E

    637
    5
    Visual Question Answering

    microsoft/git-base-textvqa

    619
    7
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2d5-7b

    535
    210
    transformers
    Visual Question Answering

    prithivMLmods/OpenMed-SynthVision-MedVL-AIO-GGUF

    480
    3
    transformers
    Visual Question Answering

    google/matcha-base

    418
    29
    transformers
    Visual Question Answering

    TIGER-Lab/VideoScore

    409
    7
    transformers
    Visual Question Answering

    second-state/MiniCPM-V-2_6-GGUF

    403
    5
    Visual Question Answering

    Swicked86/phi4-mm-gguf

    402
    2
    gguf
    1 / 9