82dl/month
Identifier
Model ID
hanxunh/AudioMosaic-vit-b16-finetune-esc-split3Tags
AudioMosaicsafetensorsarxiv:2605.14231audioaudio-classificationesc50self-supervised-learningbase_model:hanxunh/AudioMosaic-vit-b16-pretrainedbase_model:finetune:hanxunh/AudioMosaic-vit-b16-pretrainedlicense:mitmodel-indexregion:us
Use AudioMosaic-vit-b16-finetune-esc-split3 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
Organizationhanxunh
TaskAudio Classification
LibraryAudioMosaic
Licensemit
Downloads/mo82
View on HuggingFace
See model card, files, and community discussion
Related Audio Classification Models
laion/clap-htsat-fused
20.9M
audeering/wav2vec2-large-robust-24-ft-age-gender
1.5M
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim
880K
speechbrain/emotion-recognition-wav2vec2-IEMOCAP
603K
OpenMuQ/MuQ-large-msd-iter
347K
xbgoose/hubert-large-speech-emotion-recognition-russian-dusha-finetuned
331K
MIT/ast-finetuned-audioset-10-10-0.4593
317K
onecxi/open-vakgyata
312K