NEWAgents can now see video via MCP.Try it now →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    202 models available

    Showing 121144 of 202 models

    Visual Question Answering

    YifanQiao/qwen3vl4b-hist-qa-checkpoint-723

    40
    peft
    Visual Question Answering

    compling/MiniCPM-V-2

    39
    1
    Visual Question Answering

    RogerFerrod/GroundSet-LLaVA-1.6-7B

    39
    4
    Visual Question Answering

    meituan/MemOCR-7B

    38
    7
    Visual Question Answering

    google/matcha-chart2text-pew

    37
    40
    transformers
    Visual Question Answering

    google/matcha-plotqa-v1

    37
    3
    transformers
    Visual Question Answering

    OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B

    37
    8
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoRefer-7B

    37
    5
    transformers
    Visual Question Answering

    google/pix2struct-infographics-vqa-large

    35
    12
    transformers
    Visual Question Answering

    google/pix2struct-screen2words-large

    35
    22
    transformers
    Visual Question Answering

    DeclanBracken/MiniCPM-Llama3-V-2.5-Transcriptor

    35
    transformers
    Visual Question Answering

    gaoqie/Glm-Edge-V-5B-fire

    34
    1
    Visual Question Answering

    SwordElucidator/MiniCPM-Llama3-V-2_5-int4

    32
    1
    transformers
    Visual Question Answering

    google/pix2struct-infographics-vqa-base

    31
    9
    transformers
    Visual Question Answering

    HFatemeH/vilt_finetuned_200

    31
    1
    transformers
    Visual Question Answering

    MahimaNR/vilt_finetuned_200

    31
    transformers
    Visual Question Answering

    BUAADreamer/Yi-VL-34B-hf

    31
    5
    transformers
    Visual Question Answering

    Coobiw/InternLM-XComposer2_Enhanced

    30
    Visual Question Answering

    Punthon/ic-luvkka

    29
    transformers
    Visual Question Answering

    Puuje/bdaalt

    29
    Visual Question Answering

    BranZhu/Qwen3-VL-2B-HotpotQA-SFT

    29
    Visual Question Answering

    nhattan9999t/blip-kvasir-vqa

    28
    1
    transformers
    Visual Question Answering

    DeclanBracken/MiniCPM-Llama3-V-2_5-Transcriptor-V3

    28
    transformers
    Visual Question Answering

    Push2407/YOUR-REPO

    28
    transformers
    6 / 9