87dl/month
Identifier
Model ID
hanxunh/AudioMosaic-vit-b16-finetune-as2mTags
AudioMosaicsafetensorsarxiv:2605.14231audioaudio-classificationaudiosetself-supervised-learningbase_model:hanxunh/AudioMosaic-vit-b16-pretrainedbase_model:finetune:hanxunh/AudioMosaic-vit-b16-pretrainedlicense:mitmodel-indexregion:us
Use AudioMosaic-vit-b16-finetune-as2m on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
Organizationhanxunh
TaskAudio Classification
LibraryAudioMosaic
Licensemit
Downloads/mo87
View on HuggingFace
See model card, files, and community discussion
Related Audio Classification Models
laion/clap-htsat-fused
20.9M
audeering/wav2vec2-large-robust-24-ft-age-gender
1.5M
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim
880K
speechbrain/emotion-recognition-wav2vec2-IEMOCAP
603K
OpenMuQ/MuQ-large-msd-iter
347K
xbgoose/hubert-large-speech-emotion-recognition-russian-dusha-finetuned
331K
MIT/ast-finetuned-audioset-10-10-0.4593
317K
onecxi/open-vakgyata
312K