NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 74897512 of 9,588 models

    Audio Classification

    FireRedTeam/FireRedLID

    145
    13
    Object Detection

    mlx-community/YOLO26s-OptiQ-6bit

    145
    mlx
    Text To Video

    BestWishYsh/Helios-Mid

    144
    10
    diffusers
    Video Classification

    facebook/timesformer-hr-finetuned-k400

    143
    3
    transformers
    Video Classification

    steveice/videomae-base-finetuned-kinetics-finetuned-engine-subset-R2-20230417_K400_2

    143
    transformers
    Object Detection

    mustafakemal0146/playing-cards-yolov8

    143
    ultralytics
    Zero Shot Image Classification

    laion/CLIP-ViT-B-16-CommonPool.L.laion-s1B-b8K

    142
    open_clip
    Text To Video

    obsxrver/wan2.2-t2v-bdsm

    142
    8
    Object Detection

    dopaul/chess-piece-detector-merged-v2

    142
    1
    ultralytics
    Image Segmentation

    Xenova/deeplabv3-mobilevit-small

    141
    transformers.js
    Image Segmentation

    nvidia/NV-Segment-CT

    141
    17
    monai
    Audio Classification

    Xenova/ast-finetuned-speech-commands-v2

    141
    2
    transformers.js
    Audio Classification

    BKat/distilhubert-finetuned-gtzan

    141
    transformers
    Object Detection

    keremberke/yolov5m-csgo

    140
    1
    yolov5
    Object Detection

    mlx-community/YOLO26l-OptiQ-6bit

    140
    mlx
    Audio Classification

    Simma7/audio_model

    140
    transformers
    Audio Classification

    Adbhut/distilhubert-finetuned-gtzan

    140
    transformers
    Audio Classification

    BLakshmiVijay/xlsr-english

    140
    transformers
    Audio Classification

    0xb1/wav2vec2-base-finetuned-speech_commands-v0.02-finetuned-speech_commands-v0.02

    140
    transformers
    Audio Classification

    0xmagical/wavelm-clean

    140
    transformers
    Text To Audio

    AddisuSeteye/speecht5_tts_amharic2

    140
    transformers
    Text To Audio

    sil-ai/senga-LUK-aligned-speecht5

    140
    transformers
    Zero Shot Classification

    onnx-community/multilingual-MiniLMv2-L6-mnli-xnli-ONNX

    140
    transformers.js
    Unconditional Image Generation

    krasnova/ddpm_afhq_64

    139
    diffusers
    313 / 400