114dl/month
2likes
Identifier
Model ID
gaunernst/vit_base_patch16_1024_128.audiomae_as2m_ft_as20kTags
timmpytorchsafetensorsaudio-classificationarxiv:2207.06405license:cc-by-4.0region:us
Use vit_base_patch16_1024_128.audiomae_as2m_ft_as20k on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationgaunernst
TaskAudio Classification
Librarytimm
Licensecc-by-4.0
Downloads/mo114
Likes2
View on HuggingFace
See model card, files, and community discussion
Related Audio Classification Models
laion/clap-htsat-fused
16.5M
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim
887K
jakeBland/wav2vec-vm-finetune
883K
speechbrain/emotion-recognition-wav2vec2-IEMOCAP
552K
MIT/ast-finetuned-audioset-10-10-0.4593
521K
audeering/wav2vec2-large-robust-24-ft-age-gender
448K
dima806/music_genres_classification
321K
xbgoose/hubert-large-speech-emotion-recognition-russian-dusha-finetuned
284K