273dl/month
Identifier
Model ID
Simon-Kotchou/ssast-base-patch-audioset-16-16Tags
transformerssafetensorsaudio-spectrogram-transformeraudio-classificationdataset:agkphysics/AudioSetdataset:openslr/librispeech_asrarxiv:2110.09784license:bsd-3-clauseendpoints_compatibledeploy:azureregion:us
Use ssast-base-patch-audioset-16-16 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationSimon-Kotchou
TaskAudio Classification
Librarytransformers
Licensebsd-3-clause
Downloads/mo273
View on HuggingFace
See model card, files, and community discussion
Related Audio Classification Models
laion/clap-htsat-fused
16.5M
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim
887K
jakeBland/wav2vec-vm-finetune
883K
speechbrain/emotion-recognition-wav2vec2-IEMOCAP
552K
MIT/ast-finetuned-audioset-10-10-0.4593
521K
audeering/wav2vec2-large-robust-24-ft-age-gender
448K
dima806/music_genres_classification
321K
xbgoose/hubert-large-speech-emotion-recognition-russian-dusha-finetuned
284K