NEWVectors or files. Pick a path.Start →

    Audio Classification Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    543 models available

    Showing 124 of 543 models

    Audio Classification

    laion/clap-htsat-fused

    15.4M
    107
    transformers
    Audio Classification

    audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim

    628K
    170
    transformers
    Audio Classification

    audeering/wav2vec2-large-robust-24-ft-age-gender

    625K
    55
    transformers
    Audio Classification

    MIT/ast-finetuned-audioset-10-10-0.4593

    516K
    360
    transformers
    Audio Classification

    alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech

    513K
    47
    transformers
    Audio Classification

    m-a-p/MERT-v1-330M

    441K
    89
    transformers
    Audio Classification

    facebook/mms-lid-256

    391K
    18
    transformers
    Audio Classification

    OpenMuQ/MuQ-large-msd-iter

    354K
    24
    Audio Classification

    xbgoose/hubert-large-speech-emotion-recognition-russian-dusha-finetuned

    318K
    15
    transformers
    Audio Classification

    speechbrain/emotion-recognition-wav2vec2-IEMOCAP

    309K
    188
    speechbrain
    Audio Classification

    onecxi/open-vakgyata

    286K
    3
    transformers
    Audio Classification

    m-a-p/MERT-v1-95M

    219K
    51
    transformers
    Audio Classification

    facebook/audiobox-aesthetics

    152K
    48
    Audio Classification

    prithivMLmods/Common-Voice-Gender-Detection

    150K
    30
    transformers
    Audio Classification

    superb/hubert-large-superb-er

    112K
    25
    transformers
    Audio Classification

    speechbrain/lang-id-voxlingua107-ecapa

    98K
    151
    speechbrain
    Audio Classification

    facebook/mms-lid-1024

    62K
    11
    transformers
    Audio Classification

    facebook/mms-lid-4017

    59K
    15
    transformers
    Audio Classification

    OpenMuQ/MuQ-MuLan-large

    56K
    22
    Audio Classification

    superb/wav2vec2-base-superb-er

    51K
    16
    transformers
    Audio Classification

    aufklarer/WeSpeaker-ResNet34-LM-MLX

    48K
    2
    mlx
    Audio Classification

    tiantiaf/wavlm-large-categorical-emotion

    46K
    4
    Audio Classification

    tiantiaf/wavlm-large-voice-quality

    46K
    4
    Audio Classification

    aufklarer/Qwen3-ForcedAligner-0.6B-4bit

    36K
    1
    mlx
    1 / 23