NEWVectors or files. Pick a path.Start →

    Audio Classification Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    470 models available

    Showing 124 of 470 models

    Audio Classification

    laion/clap-htsat-fused

    20.9M
    103
    transformers
    Audio Classification

    audeering/wav2vec2-large-robust-24-ft-age-gender

    1.5M
    53
    transformers
    Audio Classification

    audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim

    880K
    167
    transformers
    Audio Classification

    speechbrain/emotion-recognition-wav2vec2-IEMOCAP

    603K
    188
    speechbrain
    Audio Classification

    OpenMuQ/MuQ-large-msd-iter

    347K
    23
    Audio Classification

    xbgoose/hubert-large-speech-emotion-recognition-russian-dusha-finetuned

    331K
    15
    transformers
    Audio Classification

    MIT/ast-finetuned-audioset-10-10-0.4593

    317K
    357
    transformers
    Audio Classification

    onecxi/open-vakgyata

    312K
    3
    transformers
    Audio Classification

    facebook/mms-lid-256

    286K
    17
    transformers
    Audio Classification

    facebook/audiobox-aesthetics

    231K
    48
    Audio Classification

    prithivMLmods/Common-Voice-Gender-Detection

    180K
    28
    transformers
    Audio Classification

    alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech

    176K
    47
    transformers
    Audio Classification

    aufklarer/WeSpeaker-ResNet34-LM-MLX

    147K
    2
    mlx
    Audio Classification

    OpenMuQ/MuQ-MuLan-large

    121K
    21
    Audio Classification

    speechbrain/lang-id-voxlingua107-ecapa

    120K
    149
    speechbrain
    Audio Classification

    superb/hubert-large-superb-er

    87K
    25
    transformers
    Audio Classification

    m-a-p/MERT-v1-95M

    71K
    50
    transformers
    Audio Classification

    facebook/mms-lid-1024

    67K
    11
    transformers
    Audio Classification

    audeering/wav2vec2-large-robust-6-ft-age-gender

    63K
    6
    transformers
    Audio Classification

    m-a-p/MERT-v1-330M

    57K
    88
    transformers
    Audio Classification

    superb/wav2vec2-base-superb-er

    51K
    16
    transformers
    Audio Classification

    aufklarer/Qwen3-ForcedAligner-0.6B-4bit

    50K
    1
    mlx
    Audio Classification

    Jzuluaga/accent-id-commonaccent_ecapa

    48K
    18
    speechbrain
    Audio Classification

    firdhokk/speech-emotion-recognition-with-openai-whisper-large-v3

    44K
    111
    transformers
    1 / 20