NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 86658688 of 9,588 models

    Voice Activity Detection

    Alkd/Silero-VAD-v5-CoreML

    32
    Visual Question Answering

    RhapsodyAI/minicpm-guidance

    32
    7
    transformers
    Unconditional Image Generation

    ayushshah/beta-vae-capacity-annealing-celeba

    32
    Unconditional Image Generation

    KiranKSaravana/sd-class-butterflies-32-copy-5

    32
    diffusers
    Unconditional Image Generation

    mahir123456/sd-class-butterflies-32-copy-5

    32
    diffusers
    Depth Estimation

    onnx-community/metric3d-vit-small

    32
    2
    transformers.js
    Depth Estimation

    onnx-community/DepthPro-ONNX

    32
    14
    transformers.js
    Video Classification

    CAIR-HKISI/SurgMotion

    32
    5
    pytorch
    Video Classification

    anirudhmu/videomae-base-finetuned-soccer-action-recognition

    32
    2
    transformers
    Video Classification

    TanAlexanderlz/RALL_RGBCROP_Aug16F-constant

    32
    transformers
    Video Classification

    TanAlexanderlz/UCF_NoCrop-Aug-4B16F_2

    32
    transformers
    Video Classification

    Sathwik-kom/anomaly-detector-videomae

    32
    transformers
    Video Classification

    Shuchan/videomae-base-finetuned-ucf101-subset

    32
    transformers
    Video Classification

    ashishgimekar/shot_model

    32
    transformers
    Video Classification

    agasta/scarlet

    32
    transformers
    Video Classification

    CodyOnce/videomae-base-finetuned-ucf101-subset

    32
    transformers
    Video Classification

    anirudhmu/videomae-base-finetuned-soccer-action-recognition3

    32
    transformers
    Text To Audio

    Somali-tts/somali_tts_model

    32
    transformers
    Zero Shot Classification

    AntoineBlanot/roberta-nli

    32
    transformers
    Zero Shot Classification

    DAMO-NLP-SG/zero-shot-classify-SSTuning-base

    32
    9
    transformers
    Visual Question Answering

    Puuje/bdaalt

    32
    Visual Question Answering

    MariaK/vilt_finetuned_100

    32
    transformers
    Unconditional Image Generation

    gnicob/ddpm-celebahq-finetuned-butterflies-2epochs

    32
    diffusers
    Tabular Classification

    reddysama/gnaninet-fraud-classifier

    32
    1
    numpy
    362 / 400