12Kdl/month
8likes
Identifier
Model ID
espnet/owsm_ctc_v4_1BTags
espnetaudioautomatic-speech-recognitionspeech-translationlanguage-identificationmultilingualdataset:espnet/yodas_owsmv4arxiv:2406.09282arxiv:2401.16658arxiv:2309.13876license:cc-by-4.0eval-resultsregion:us
Use owsm_ctc_v4_1B on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationespnet
TaskAutomatic Speech Recognition
Libraryespnet
Licensecc-by-4.0
Downloads/mo12K
Likes8
View on HuggingFace
See model card, files, and community discussion
Related Automatic Speech Recognition Models
pyannote/speaker-diarization-3.1
10.2M
argmaxinc/whisperkit-coreml
8.1M
openai/whisper-large-v3-turbo
7.0M
openai/whisper-large-v3
4.9M
jonatasgrosman/wav2vec2-large-xlsr-53-russian
4.9M
jonatasgrosman/wav2vec2-large-xlsr-53-portuguese
3.8M
MahmoudAshraf/mms-300m-1130-forced-aligner
3.7M
pyannote/voice-activity-detection
2.7M