diar_streaming_sortformer_4spk-v2.1

Name: diar_streaming_sortformer_4spk-v2.1
Author: nvidia

by nvidia

19Kdl/month

64likes

HuggingFace Use in Pipeline

Identifier

Model ID

nvidia/diar_streaming_sortformer_4spk-v2.1

Use diar_streaming_sortformer_4spk-v2.1 on Mixpeek

Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

Open Pipeline Builder

Specification

Organizationnvidia

TaskAutomatic Speech Recognition

Librarynemo

Licenseother

Downloads/mo19K

Likes64

View on HuggingFace

See model card, files, and community discussion

Related Automatic Speech Recognition Models

pyannote/speaker-diarization-3.1

10.2M

argmaxinc/whisperkit-coreml

8.1M

openai/whisper-large-v3-turbo

7.0M

openai/whisper-large-v3

4.9M

jonatasgrosman/wav2vec2-large-xlsr-53-russian

4.9M

jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

3.8M

MahmoudAshraf/mms-300m-1130-forced-aligner

3.7M

pyannote/voice-activity-detection

2.7M

diar_streaming_sortformer_4spk-v2.1

Tags

Use diar_streaming_sortformer_4spk-v2.1 on Mixpeek

Specification

View on HuggingFace

Related Automatic Speech Recognition Models