NEWWhy single embeddings fail for video.Read the post →

    AI Model Hub

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    9,588 models available

    Showing 78257848 of 9,588 models

    Object Detection

    Mike-Alem/License-Plate-Detection-First-Project

    107
    Image Segmentation

    yolo12138/segformer-b2-cloth-parse-9

    107
    7
    transformers
    Image Segmentation

    stevenbucaille/rf-detr-seg-preview

    107
    transformers
    Zero Shot Image Classification

    visheratin/nllb-clip-base-oc

    107
    2
    open_clip
    Image Feature Extraction

    facebook/webssl-dino2b-full2b-224

    107
    transformers
    Audio Classification

    Xenova/wav2vec2-large-xlsr-53-gender-recognition-librispeech

    107
    1
    transformers.js
    Audio Classification

    ANANDAPADMANABHANANS/deepfake-audio-detector-v9

    107
    transformers
    Audio Classification

    ardneebwar/wav2vec2-animal-sounds-finetuned-hubert-finetuned-animals

    107
    10
    transformers
    Audio Classification

    aoussou/ast-finetuned-audioset-10-10-0.4593-finetuned-gtzan

    107
    transformers
    Image Segmentation

    nielsr/segformer-b0-finetuned-segments-sidewalk

    107
    transformers
    Object Detection

    Xenova/yolov9-c

    106
    6
    transformers.js
    Object Detection

    baselefre/objectdetectionaugmentedclean

    106
    Object Detection

    JadeRay-42/MonoFDETR

    106
    Object Detection

    Charliesgt/pollen_detr_resnet50_benchmark

    106
    transformers
    Image Segmentation

    qualcomm/FastSam-S

    106
    8
    pytorch
    Audio Classification

    amirahmadian16/sl_persian_ser_with_gwo_and_hubert

    106
    transformers
    Audio Classification

    alidenewade/ast-finetuned-audioset-10-10-0.4593-finetuned-gtzan

    106
    transformers
    Audio Classification

    Mrsmetamorphosis/dementia-wav2vec-scientific-specaugment-V3

    106
    transformers
    Text To Audio

    Marvis-AI/marvis-tts-100m-v0.2-MLX-6bit

    106
    4
    transformers
    Audio To Audio

    HiDolen/Mini-BS-RoFormer

    106
    1
    transformers
    Image Feature Extraction

    MFY111/dinov3-vits16

    105
    transformers
    Audio Classification

    Jainam/audio_classify_v1

    105
    transformers
    Audio Classification

    firdhokk/speech-emotion-recognition-with-facebook-wav2vec2-large-xlsr-53

    105
    transformers
    Text To Audio

    zhangj1an/AudioX

    105
    diffusers
    327 / 400