NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    10,221 models available

    Showing 94579480 of 10,221 models

    Zero Shot Classification

    paulhindemith/fasttext-classification

    21
    transformers
    Visual Question Answering

    BUAADreamer/Yi-VL-34B-hf

    21
    5
    transformers
    Visual Question Answering

    SwordElucidator/MiniCPM-Llama3-V-2_5

    21
    transformers
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base

    21
    1
    transformers
    Visual Question Answering

    TeeA/DONUT-ViChart

    21
    1
    transformers
    Visual Question Answering

    azwierzc/vilt-b32-finetuned-vqa-pl

    21
    transformers
    Video Classification

    RodrigoFardin/videomae-base-finetuned-dd

    21
    transformers
    Video Classification

    Shawon16/Timesformer_default_fold_10_10_epoch_noAug_batch8_codecheck

    21
    transformers
    Video Classification

    Shawon16/Timesformer_WLASL_100_200_epochs_p20_SR_16

    21
    transformers
    Video Classification

    awnw/videomae-base-finetuned-ucf101-subset

    21
    transformers
    Video Classification

    HaileyJu/videomae-base-finetuned-ucf101-subset-SBDtoy

    21
    transformers
    Video Classification

    Shawon16/VideoMAE_Base_WLASL_100_200_epochs_p20_SR_8

    21
    transformers
    Video Classification

    Mayank1996/videomae-base-finetuned-ucf101-subset

    21
    transformers
    Video Classification

    DanJoshua/student_videomobilevit_dist_kl_temp_1_alpha_0.6_teacher_mvit_v2_s_RWF2000

    21
    transformers
    Video Classification

    VidaRoha/videomae-base-finetuned-kinetics-transferred-traffic-subset

    21
    transformers
    Video Classification

    LinStevenn/videomae-base-readminds-assignment

    21
    transformers
    Video Classification

    rickysk/videomae-base-ipm_all_videos_gb2

    21
    transformers
    Zero Shot Classification

    knowledgator/gliclass-qwen-1.5B-v1.0

    21
    2
    Visual Question Answering

    DAMO-NLP-SG/VideoLLaMA2-7B-Base

    21
    6
    transformers
    Unconditional Image Generation

    shellypeng/atomixl_realistic

    21
    diffusers
    Unconditional Image Generation

    KushalRamaiya/sd-class-butterflies-32

    21
    diffusers
    Unconditional Image Generation

    eurecom-ds/scoresdeve-ema-multi-dsprites-64

    21
    diffusers
    Video Classification

    Julia0408/videomae-base-finetuned-ucf101-subset

    21
    transformers
    Video Classification

    NiiCole/vivit-b-16x2-kinetics400-finetuned-ucf101-subset1

    21
    transformers
    395 / 426