NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 95539576 of 10,221 models

    Visual Question Answering

    Yosemat/designvlm

    19
    1
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-72B

    19
    10
    transformers
    Visual Question Answering

    LeroyDyer/_Spydaz_Web_AI_LlavaNext

    19
    1
    transformers
    Visual Question Answering

    introvoyz041/Ministral-3B-MedVL-Q8_0-GGUF

    19
    Visual Question Answering

    gaoqie/Qwen2.5VL-7B-Instruct-fire

    19
    1
    Visual Question Answering

    amitha/mllava-llama2-en-zh

    19
    transformers
    Visual Question Answering

    r-g2-2024/Llama-3.1-70B-Instruct-multimodal-JP-Graph-v0.1

    19
    19
    Document Question Answering

    PrplHrt/LayoutLMv2_hub

    19
    transformers
    Document Question Answering

    PrimWong/layoutlmv2-base-uncased_finetuned_docvqa

    19
    transformers
    Video Classification

    bluebird089/videomae-small-finetuned-kinetics-finetuned-round2-v4

    19
    transformers
    Video Classification

    virkha/videomae-base-finetuned-ucf101-subset

    19
    transformers
    Video Classification

    ayushexel/vjepa2-384

    19
    transformers
    Zero Shot Classification

    sknow-lab/Qwen2.5-14B-CIC-ACLARC-GGUF

    19
    1
    transformers
    Zero Shot Classification

    dratima123/mDeBERTa-v3-base-mnli-xnli

    19
    Visual Question Answering

    zenlm/zen-designer-235b-a22b-thinking

    19
    1
    transformers
    Visual Question Answering

    Keetawan/BLIP2SeaLLMs-1.5B

    19
    transformers
    Visual Question Answering

    Keetawan/BLIP2SeaLLMs-1.5B_COCO

    19
    transformers
    Table Question Answering

    liuddf/tapex-base

    19
    Video Classification

    OckerGui/videomae-base-finetuned-ASBD_ESBD

    19
    transformers
    Video Classification

    OckerGui/videomae-base-finetuned-ESBD_Augm

    19
    transformers
    Video Classification

    NiiCole/vivit-b-16x2-kinetics400-PET-lora-vivit-01

    19
    transformers
    Video Classification

    Ham1mad1/videomae-base-Vsl-Lab-PC-V5

    19
    transformers
    Video Classification

    Leonlala/videomae-base-finetuned-ucf101-subset

    19
    transformers
    Zero Shot Classification

    emrecan/distilbert-base-turkish-cased-allnli_tr

    19
    1
    transformers
    399 / 426