NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 89298952 of 9,588 models

    Zero Shot Classification

    mjwong/multilingual-e5-base-xnli-anli

    20
    transformers
    Zero Shot Classification

    gincioks/smartshot-zeroshot-finetuned-v0.2.0

    20
    Zero Shot Classification

    cmarkea/bloomz-3b-nli

    20
    1
    transformers
    Zero Shot Classification

    cublya/bge-m3-zeroshot-v2.0

    20
    transformers
    Visual Question Answering

    erax-ai/EraX-VL-7B-V2.0-Preview

    20
    27
    transformers
    Visual Question Answering

    amitha/mllava-baichuan2-en

    20
    transformers
    Visual Question Answering

    ZGZzz/SAME

    20
    same
    Visual Question Answering

    Pankaj121212/blip-2-fine-tuned

    20
    transformers
    Visual Question Answering

    andrewqian123/LLAMA_BATCH

    20
    Zero Shot Classification

    AntoineBlanot/flan-t5-xxl-classif-3way

    20
    3
    transformers
    Video Classification

    microsoft/xclip-base-patch16-hmdb-2-shot

    20
    transformers
    Video Classification

    bluebird089/videomae-base-finetuned-kinetics-finetuned-round2-v3

    20
    transformers
    Video Classification

    Mayank1996/videomae-base-finetuned-ucf101-subset

    20
    transformers
    Video Classification

    DanJoshua/student_videomobilevit_dist_kl_temp_1_alpha_0.6_teacher_mvit_v2_s_RWF2000

    20
    transformers
    Video Classification

    PergaZuZ/videomae-base-finetuned-lift-data-resize

    20
    transformers
    Video Classification

    Peregalli/videomae-base-finetuned-ucf101-subset

    20
    transformers
    Video Classification

    Simma7/deepfake_model

    20
    Unconditional Image Generation

    uripper/GIANNIS

    20
    diffusers
    Unconditional Image Generation

    CCMat/ddpm-church-finetune-wikiart-256

    20
    diffusers
    Voice Activity Detection

    aitytech/Silero-VAD-v5-MLX

    19
    mlx
    Voice Activity Detection

    BricksDisplay/silero-vad

    19
    Document Question Answering

    Sharka/CIVQA_DVQA_LayoutXLM

    19
    transformers
    Document Question Answering

    ashaduzzaman/layoutlmv2-base-uncased_finetuned_docvqa

    19
    transformers
    Table Question Answering

    alinh1803/opt-350-fine-tuning

    19
    transformers
    373 / 400