NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 96259648 of 10,221 models

    Visual Question Answering

    GeorgyGUF/INFRL-Qwen2.5-VL-72B-Preview-bf16.gguf

    18
    transformers
    Visual Question Answering

    TeeA/MATCHA-ViChart

    18
    transformers
    Unconditional Image Generation

    CCMat/ddpm-church-finetune-wikiart-256

    18
    diffusers
    Video Classification

    microsoft/xclip-base-patch16-ucf-16-shot

    18
    2
    transformers
    Video Classification

    IsraelSonseca/videomae-base-finetuned-ucf101_sport-subset

    18
    transformers
    Video Classification

    NiiCole/videomae-base-finetuned-ucf101-subset

    18
    transformers
    Video Classification

    hafidber/videomae-base-finetuned-ucf101-subset

    18
    transformers
    Video Classification

    Sapezb/ViVitTrained-ABAW

    18
    transformers
    Video Classification

    NiklasTUM/VideoMAEv2-Huge-finetuned-deception-dataset-mae-huge

    18
    1
    transformers
    Zero Shot Classification

    mjwong/gte-multilingual-base-xnli

    18
    Visual Question Answering

    NhatDFO/sf_blip2

    18
    1
    transformers
    Visual Question Answering

    jmonas/ViLT-33M-vqa

    18
    transformers
    Visual Question Answering

    Mediocreatmybest/blip2-flan-t5-xxl_8bit

    18
    2
    transformers
    Visual Question Answering

    Devops-hestabit/InternVL-chat

    18
    transformers
    Unconditional Image Generation

    benlehrburger/modern-architecture-32

    18
    1
    diffusers
    Unconditional Image Generation

    Dibyasha2023/ddpm-celebahq-finetuned-butterflies-2epochs

    18
    diffusers
    Voice Activity Detection

    philschmid/pyannote-segmentation

    17
    10
    pyannote-audio
    Voice Activity Detection

    Cactus-Compute/silero-vad

    17
    Voice Activity Detection

    BricksDisplay/ten-vad

    17
    Document Question Answering

    QUBUHUB/roberta-base-squad2-optimized

    17
    generic
    Document Question Answering

    nikravan/glm-4vq

    17
    36
    transformers
    Document Question Answering

    TusharGoel/LayoutLMv2-finetuned-docvqa

    17
    1
    transformers
    Document Question Answering

    Sharka/CIVQA_LayoutXLM

    17
    2
    transformers
    Document Question Answering

    frizwankhan/entity-linking-model-final

    17
    transformers
    402 / 426