NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 86898712 of 9,588 models

    Voice Activity Detection

    salmanshahid/segmentation

    31
    Voice Activity Detection

    videosdk-live/Namo-Turn-Detector-v1-English

    31
    onnxruntime
    Unconditional Image Generation

    rxliang/sd-class-butterflies-32

    31
    diffusers
    Tabular Classification

    SnowFlash383935/DigitalEduTransformers

    31
    1
    transformers
    Depth Estimation

    qualcomm/Depth-Anything-V3

    31
    2
    pytorch
    Video Classification

    marekk/video_soccer_goal_detection

    31
    transformers
    Video Classification

    dd00697/videomae-base-finetuned-ucf101-subset

    31
    transformers
    Video Classification

    Awais1718/videomae-base-finetuned-kinetics-finetuned-shoplifting-dataset

    31
    1
    transformers
    Video Classification

    irenetrecu/videomae-base-finetuned-conflab

    31
    transformers
    Video Classification

    adenhaus/videomae-small-finetuned-ssv2-finetuned-judo

    31
    transformers
    Video Classification

    sirishgam001/videomae-finetuned-engagenet-full

    31
    transformers
    Video Classification

    codircodir/videomae-base-finetuned-kinetics-finetuned-ucf101-subset-finetuned-N

    31
    transformers
    Video Classification

    SujitShelar/vjepa2-vitl-fpc16-256-hmdb51

    31
    1
    transformers
    Video Classification

    tayyabimam/Deepfake

    31
    3
    Zero Shot Classification

    onnx-community/distilbert-base-uncased-mnli-ONNX

    31
    1
    transformers.js
    Visual Question Answering

    DAMO-NLP-SG/VideoRefer-7B

    31
    5
    transformers
    Visual Question Answering

    Punthon/ic-luvkka

    31
    transformers
    Visual Question Answering

    Coobiw/InternLM-XComposer2_Enhanced

    31
    Visual Question Answering

    BranZhu/Qwen3-VL-2B-HotpotQA-SFT

    31
    Visual Question Answering

    Push2407/YOUR-REPO

    31
    transformers
    Unconditional Image Generation

    ym999ai/ddpm-celebahq-finetuned-butterflies-2epochs

    31
    diffusers
    Video Classification

    Parallax-labs-1/parallax_TEMPORAL-ValidPhone

    31
    pytorch
    Unconditional Image Generation

    Parallax-labs-1/parallax_VIDEO-Boxes

    31
    pytorch
    Video Classification

    facebook/timesformer-hr-finetuned-ssv2

    30
    2
    transformers
    363 / 400