NEWVectors or files. Pick a path.Start →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    326 models available

    Showing 124 of 326 models

    Visual Question Answering

    Salesforce/blip-vqa-base

    414K
    194
    transformers
    Visual Question Answering

    dandelin/vilt-b32-finetuned-vqa

    60K
    423
    transformers
    Visual Question Answering

    google/deplot

    34K
    317
    transformers
    Visual Question Answering

    Salesforce/blip-vqa-capfilt-large

    19K
    54
    transformers
    Visual Question Answering

    openbmb/MiniCPM-V-2

    13K
    499
    transformers
    Visual Question Answering

    TIGER-Lab/VideoScore2

    10K
    3
    Visual Question Answering

    UII-AI/uAI-NEXUS-MedVLM-1.0a-7B-RL

    9K
    14
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-7B-16F

    7K
    14
    transformers
    Visual Question Answering

    chaoyinshe/llava-med-v1.5-mistral-7b-hf

    3K
    6
    Visual Question Answering

    google/pix2struct-docvqa-base

    3K
    44
    transformers
    Visual Question Answering

    google/matcha-chartqa

    2K
    47
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2.1-7B-AV

    1K
    16
    transformers
    Visual Question Answering

    microsoft/git-large-vqav2

    1K
    21
    transformers
    Visual Question Answering

    microsoft/git-base-vqav2

    1K
    21
    transformers
    Visual Question Answering

    google/pix2struct-ai2d-base

    1K
    43
    transformers
    Visual Question Answering

    BUAADreamer/Yi-VL-6B-hf

    1K
    2
    transformers
    Visual Question Answering

    google/pix2struct-chartqa-base

    1K
    10
    transformers
    Visual Question Answering

    ricoh-ai/Qwen-3-VL-Ricoh-8B-20260227

    1K
    18
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-7B

    849
    40
    transformers
    Visual Question Answering

    google/matcha-base

    842
    29
    transformers
    Visual Question Answering

    second-state/MiniCPM-V-2_6-GGUF

    718
    5
    Visual Question Answering

    prithivMLmods/OpenMed-SynthVision-MedVL-AIO-GGUF

    715
    3
    transformers
    Visual Question Answering

    microsoft/git-base-textvqa

    692
    7
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2-vl-7b

    687
    84
    transformers
    1 / 14