NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 76097632 of 9,588 models

    Object Detection

    keremberke/yolov5n-forklift

    127
    1
    yolov5
    Object Detection

    mderyaguler/detr-resnet-50-dc5-fashionpedia-finetuned

    127
    transformers
    Object Detection

    hafsa101010/practica_2

    127
    transformers
    Image Feature Extraction

    py-feat/retinaface

    127
    7
    py-feat
    Text To Video

    Kotajiro/LTX23-ruri_LoRA

    127
    2
    diffusers
    Audio Classification

    Xenova/ast-finetuned-audioset-16-16-0.442

    127
    transformers.js
    Audio Classification

    MarekCech/GenreVim-Music-Classification-DistilHuBERT

    127
    transformers
    Zero Shot Classification

    cmarkea/bloomz-560m-nli

    127
    1
    transformers
    Video Classification

    Shawon16/videoMAE_base_wlasl_100_50ep_coR_p10

    127
    transformers
    Visual Question Answering

    JosephDefonse/Med3DVLM-PMCT

    126
    Table Question Answering

    QuantFactory/TableLLM-13b-GGUF

    126
    5
    transformers
    Object Detection

    leeyunjai/yolo11-firedetect

    126
    6
    ultralytics
    Image Segmentation

    Xenova/deeplabv3-mobilevit-x-small

    126
    transformers.js
    Image Feature Extraction

    timm/convnext_base.clip_laiona

    126
    timm
    Image Feature Extraction

    amildravid4292/clip-vitb16-test-time-registers

    126
    2
    transformers
    Audio Classification

    aoliveira/ast-finetuned-audioset-10-10-0.4593-finetuned-gtzan

    126
    transformers
    Audio Classification

    circulus/canvers-sound-event-v1

    126
    transformers
    Audio Classification

    jananiramaseshan/ast-genre-classifier-frozen

    126
    transformers
    Text To Audio

    wide-video/musicgen-small-v1.0.0

    126
    transformers.js
    Audio To Audio

    Ademola265/Qwen3-TTS-Tokenizer-12Hz

    125
    Object Detection

    EFFGRP/yolov11n-warehouse-pallets-640

    125
    ultralytics
    Zero Shot Image Classification

    joaodaniel/RS-M-CLIP

    125
    2
    open_clip
    Audio Classification

    Bhaveen/Musical-Instrument-Classification

    125
    1
    transformers
    Audio Classification

    Xenova/discogs-maest-30s-pw-73e-ts

    125
    transformers.js
    318 / 400