Audio
Speech to Text
Convert speech content to text with timestamps and confidence scores
Note: This playground provides simulated output to showcase functionality. No input data is processed or stored on our servers. Use this demo to explore the feature extractor's capabilities before integrating it into your application.
Input
Enter a URL to a audio file
Drag and drop a audio file here, or click to browse
The speech recognition model to use. Default: whisper-large
Language of the audio content. Default: auto
Output
{"transcript": {"type": "string"},"segments": {"type": "array","items": {"type": "object","properties": {"text": {"type": "string"},"start": {"type": "number"},"end": {"type": "number"},"confidence": {"type": "number"}}}}}