NEWVectors or files. Pick a path.Start →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    273 models available

    Showing 124 of 273 models

    Visual Question Answering

    Salesforce/blip-vqa-base

    337K
    194
    transformers
    Visual Question Answering

    openbmb/MiniCPM-V-2

    73K
    497
    transformers
    Visual Question Answering

    dandelin/vilt-b32-finetuned-vqa

    60K
    422
    transformers
    Visual Question Answering

    google/deplot

    36K
    317
    transformers
    Visual Question Answering

    TIGER-Lab/VideoScore2

    27K
    3
    Visual Question Answering

    Salesforce/blip-vqa-capfilt-large

    20K
    54
    transformers
    Visual Question Answering

    UII-AI/uAI-NEXUS-MedVLM-1.0a-7B-RL

    13K
    13
    Visual Question Answering

    chaoyinshe/llava-med-v1.5-mistral-7b-hf

    3K
    6
    Visual Question Answering

    google/pix2struct-docvqa-base

    3K
    44
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2.1-7B-AV

    2K
    16
    transformers
    Visual Question Answering

    google/matcha-chartqa

    2K
    47
    transformers
    Visual Question Answering

    openbmb/MiniCPM-V

    2K
    207
    transformers
    Visual Question Answering

    microsoft/git-large-vqav2

    1K
    19
    transformers
    Visual Question Answering

    google/matcha-base

    1K
    29
    transformers
    Visual Question Answering

    second-state/MiniCPM-V-2_6-GGUF

    1K
    5
    Visual Question Answering

    google/pix2struct-ai2d-base

    1K
    43
    transformers
    Visual Question Answering

    microsoft/git-base-vqav2

    1K
    21
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2-vl-7b

    904
    84
    transformers
    Visual Question Answering

    google/pix2struct-chartqa-base

    881
    10
    transformers
    Visual Question Answering

    second-state/MiniCPM-V-4_5-GGUF

    777
    14
    Visual Question Answering

    mradermacher/Supertron-VL-2B-GGUF

    627
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2-4khd-7b

    606
    73
    transformers
    Visual Question Answering

    mradermacher/MemOCR-7B-i1-GGUF

    603
    1
    transformers
    Visual Question Answering

    erax-ai/EraX-VL-7B-V2.0-Preview

    590
    27
    transformers
    1 / 12