NEWVectors or files. Pick a path.Start →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    273 models available

    Showing 4972 of 273 models

    Visual Question Answering

    gaianet/MiniCPM-Llama3-V-2_5-GGUF

    188
    3
    Visual Question Answering

    RogerFerrod/GroundSet-LLaVA-1.6-7B

    184
    6
    Visual Question Answering

    Lin-Chen/sharegpt4video-8b

    183
    45
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2-vl-1_8b

    177
    18
    transformers
    Visual Question Answering

    mradermacher/TreeVGR-7B-CI-GGUF

    175
    1
    transformers
    Visual Question Answering

    BUAADreamer/Yi-VL-6B-hf

    175
    2
    transformers
    Visual Question Answering

    OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B

    170
    8
    transformers
    Visual Question Answering

    internlm/internlm-xcomposer2d5-7b-4bit

    168
    13
    transformers
    Visual Question Answering

    google/matcha-chart2text-pew

    164
    40
    transformers
    Visual Question Answering

    mradermacher/MemOCR-7B-GGUF

    158
    1
    transformers
    Visual Question Answering

    mradermacher/NayanaVQA-GGUF

    147
    transformers
    Visual Question Answering

    erax-ai/EraX-VL-7B-V1.5

    140
    9
    transformers
    Visual Question Answering

    omlab/VLM-R1-Qwen2.5VL-3B-Math-0305

    128
    8
    Visual Question Answering

    gaianet/MiniCPM-V-2_6-GGUF

    124
    Visual Question Answering

    RussRobin/SpatialBot-3B

    111
    19
    transformers
    Visual Question Answering

    OpenMed/Qwen2.5-3B-MedVL

    111
    2
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-7B-16F

    107
    14
    transformers
    Visual Question Answering

    prapaa/eastrus-vl-qwen3-8b

    100
    Visual Question Answering

    wumengyangok/LLaVA-SpaceSGG

    99
    Visual Question Answering

    openbmb/OmniLMM-12B

    95
    73
    transformers
    Visual Question Answering

    Jesteban247/brats_medgemma-GGUF

    94
    transformers
    Visual Question Answering

    microsoft/git-large-textvqa

    93
    6
    transformers
    Visual Question Answering

    google/matcha-chart2text-statista

    80
    10
    transformers
    Visual Question Answering

    google/pix2struct-screen2words-base

    80
    25
    transformers
    3 / 12