NEWVectors or files. Pick a path.Start →

    Visual Question Answering Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    273 models available

    Showing 2548 of 273 models

    Visual Question Answering

    ricoh-ai/Qwen-3-VL-Ricoh-8B-20260227

    581
    14
    Visual Question Answering

    internlm/internlm-xcomposer2d5-7b

    554
    210
    transformers
    Visual Question Answering

    microsoft/git-base-textvqa

    538
    7
    transformers
    Visual Question Answering

    prithivMLmods/OpenMed-SynthVision-MedVL-AIO-GGUF

    512
    3
    transformers
    Visual Question Answering

    Swicked86/phi4-mm-gguf

    473
    3
    gguf
    Visual Question Answering

    mradermacher/CoE-SlideVQA-8B-i1-GGUF

    457
    transformers
    Visual Question Answering

    google/matcha-plotqa-v2

    401
    13
    transformers
    Visual Question Answering

    mradermacher/CoE-Wiki-CoE-8B-i1-GGUF

    392
    transformers
    Visual Question Answering

    introvoyz041/OpenMed-SynthVision-MedVL-AIO-GGUF

    360
    transformers
    Visual Question Answering

    SimulaMet/Qwen2.5-VL-KvasirVQA-x1-ft

    346
    peft
    Visual Question Answering

    second-state/MiniCPM-V-4-GGUF

    343
    1
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-7B

    318
    40
    transformers
    Visual Question Answering

    openbmb/MiniCPM-Llama3-V-2_5-int4

    310
    80
    transformers
    Visual Question Answering

    second-state/MiniCPM-Llama3-V-2_5-GGUF

    296
    1
    Visual Question Answering

    mradermacher/CoE-SlideVQA-8B-GGUF

    277
    transformers
    Visual Question Answering

    erax-ai/EraX-VL-2B-V1.5

    236
    10
    transformers
    Visual Question Answering

    gaianet/MiniCPM-V-4_5-GGUF

    230
    4
    Visual Question Answering

    mradermacher/TreeVGR-7B-CI-i1-GGUF

    229
    1
    transformers
    Visual Question Answering

    gaianet/MiniCPM-V-4-GGUF

    216
    Visual Question Answering

    google/pix2struct-docvqa-large

    210
    33
    transformers
    Visual Question Answering

    mPLUG/mPLUG-Owl3-7B-241101

    201
    10
    Visual Question Answering

    mradermacher/CoE-Wiki-CoE-8B-GGUF

    201
    transformers
    Visual Question Answering

    google/pix2struct-widget-captioning-base

    200
    6
    transformers
    Visual Question Answering

    ybelkada/blip2-opt-2.7b-fp16-sharded

    189
    3
    transformers
    2 / 12