NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 95059528 of 10,221 models

    Visual Question Answering

    amitha/mllava-baichuan2-en

    20
    transformers
    Visual Question Answering

    Pankaj121212/blip-2-fine-tuned

    20
    transformers
    Visual Question Answering

    andrewqian123/LLAMA_BATCH

    20
    Visual Question Answering

    Foreshhh/Qwen2-VL-7B-SafeRLHF

    20
    3
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-8x7B

    20
    3
    transformers
    Visual Question Answering

    GeorgyGUF/INFRL-Qwen2.5-VL-72B-Preview-q8-with-bf16-output-and-bf16-embedding.gguf

    20
    transformers
    Visual Question Answering

    OpenDataArena/MMFineReason-2B

    20
    8
    Visual Question Answering

    OpenMed/Ministral-3B-MedVL

    20
    2
    Video Classification

    bluebird089/videomae-base-finetuned-kinetics-finetuned-round2-v3

    20
    transformers
    Video Classification

    PergaZuZ/videomae-base-finetuned-lift-data-resize

    20
    transformers
    Video Classification

    Simma7/deepfake_model

    20
    Video Classification

    JackWong0911/videomae-base-finetuned-ucf101-subset

    20
    transformers
    Video Classification

    lmazzon70/videomae-base-short-finetuned-ssv2-finetuned-rwf2000-epochs8

    20
    transformers
    Video Classification

    LinboTTT/videomae-base-finetuned-emonet-subset

    20
    transformers
    Zero Shot Classification

    mjwong/e5-large-mnli-anli

    20
    transformers
    Visual Question Answering

    Foreshhh/Qwen2-VL-7B-VLGuard

    20
    1
    Unconditional Image Generation

    li-yan/sd-class-butterflies-64

    20
    diffusers
    Unconditional Image Generation

    uripper/GIANNIS

    20
    diffusers
    Video Classification

    microsoft/xclip-base-patch16-kinetics-600-16-frames

    20
    2
    transformers
    Video Classification

    OckerGui/videomae-base-finetuned-ESBD

    20
    transformers
    Zero Shot Classification

    DAMO-NLP-SG/zero-shot-classify-SSTuning-large

    20
    2
    transformers
    Zero Shot Classification

    glamprou/switch-base-8-mnli

    20
    transformers
    Zero Shot Classification

    asadfgglie/mDeBERTa-v3-base-xnli-multilingual-zeroshot-v3.0-only-non-nli

    20
    1
    Visual Question Answering

    MohamedTahir/ViLTVQA

    20
    transformers
    397 / 426