6Kdl/month
2likes
Identifier
Model ID
gaunernst/vit_base_patch16_1024_128.audiomae_as2m_ft_as20kTags
timmpytorchsafetensorsaudio-classificationarxiv:2207.06405license:cc-by-4.0region:us
Use vit_base_patch16_1024_128.audiomae_as2m_ft_as20k on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
Organizationgaunernst
TaskAudio Classification
Librarytimm
Licensecc-by-4.0
Downloads/mo6K
Likes2
View on HuggingFace
See model card, files, and community discussion
Related Audio Classification Models
laion/clap-htsat-fused
20.9M
audeering/wav2vec2-large-robust-24-ft-age-gender
1.5M
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim
880K
speechbrain/emotion-recognition-wav2vec2-IEMOCAP
603K
OpenMuQ/MuQ-large-msd-iter
347K
xbgoose/hubert-large-speech-emotion-recognition-russian-dusha-finetuned
331K
MIT/ast-finetuned-audioset-10-10-0.4593
317K
onecxi/open-vakgyata
312K