Audio

Speaker Diarization

Identify and separate different speakers in audio content

320K runs

Note: This playground provides simulated output to showcase functionality. No input data is processed or stored on our servers. Use this demo to explore the feature extractor's capabilities before integrating it into your application.

Input

File URL string

Enter a URL to a audio file

Upload audio

Drag and drop a audio file here, or click to browse

Select File

# model string

The speaker diarization model to use. Default: pyannote

# min_speakers integer

Minimum number of speakers to detect. Default: 1

# max_speakers integer

Maximum number of speakers to detect. Default: 5

Output

{
  "segments": {
    "type": "array",
    "items": {
      "type": "object",
      "properties": {
        "speaker_id": {
          "type": "string"
        },
        "start": {
          "type": "number"
        },
        "end": {
          "type": "number"
        },
        "confidence": {
          "type": "number"
        }
      }
    }
  }
}

Ready to run Speaker Diarization on your data? Spin it up in Studio — no infra to host.

Run this in Studio

Already have embeddings? Skip extraction — search your own vectors with MVS. First 1M vectors free.

Try MVS →