NEWAgents can now see video via MCP.Try it now →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    202 models available

    Showing 169192 of 202 models

    Visual Question Answering

    azwierzc/vilt-b32-finetuned-vqa-pl

    19
    transformers
    Visual Question Answering

    TeeA/DONUT-ViChart

    19
    1
    transformers
    Visual Question Answering

    bgyoo/vilt_finetuned_200

    19
    transformers
    Visual Question Answering

    andrewqian123/LLAMA_BATCH

    19
    Visual Question Answering

    UBC-NLP/dallah

    19
    3
    Visual Question Answering

    Foreshhh/Qwen2-VL-7B-SafeRLHF

    19
    3
    Visual Question Answering

    Maria-pro/my_vqa_model

    19
    transformers
    Visual Question Answering

    gaoqie/Qwen2.5VL-7B-Instruct-fire

    19
    1
    Visual Question Answering

    0xAmey/tinyllava-1.1b-v0.1

    18
    21
    transformers
    Visual Question Answering

    OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B

    18
    7
    transformers
    Visual Question Answering

    ManishThota/InstructVQA

    18
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-8x7B

    18
    3
    transformers
    Visual Question Answering

    PengxiangLi/MAT

    18
    2
    Visual Question Answering

    convexray/deplot

    18
    Visual Question Answering

    mattia-re-learn/llava-v1.5-13b

    18
    transformers
    Visual Question Answering

    google/pix2struct-ocrvqa-large

    17
    34
    transformers
    Visual Question Answering

    hf-tiny-model-private/tiny-random-Blip2ForConditionalGeneration

    17
    transformers
    Visual Question Answering

    amitha/mllava-llama2-en-zh

    17
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-72B-Base

    17
    1
    transformers
    Visual Question Answering

    TIGER-Lab/VL-Reasoner-7B

    17
    1
    transformers
    Visual Question Answering

    r-g2-2024/Llama-3.1-70B-Instruct-multimodal-JP-Graph-v0.1

    17
    19
    Visual Question Answering

    mncai/hunmin_vlm_235b_v0.11_merged_cua

    17
    3
    transformers
    Visual Question Answering

    OpenDataArena/MMFineReason-2B

    17
    8
    Visual Question Answering

    IDEA-CCNL/Ziya-BLIP2-14B-Visual-v1

    17
    58
    transformers
    8 / 9