AudioSentimentConverter
Analyze the sentiment and emotional tone of audio recordings by combining speech transcription with acoustic feature analysis. Detects positive, negative, and neutral sentiment at utterance and segment levels, with additional emotion classification for anger, joy, frustration, and more.
How It Works
Upload an audio file or provide a URL to the Mixpeek API.
The audio is transcribed with speaker diarization and utterance-level segmentation.
Lexical sentiment is analyzed from the transcript text using an NLP model.
Acoustic sentiment is analyzed from vocal features including pitch, energy, speaking rate, and tone.
Lexical and acoustic scores are fused into a combined sentiment and emotion profile per segment.
Code Examples
from mixpeek import Mixpeekclient = Mixpeek(api_key="YOUR_API_KEY")result = client.convert(source="https://example.com/support-call.mp3",from_format="audio",to_format="sentiment",options={"speaker_diarization": True,"include_emotions": True,"include_trajectory": True,"interval_seconds": 30})print(f"Overall sentiment: {result.overall.label} ({result.overall.score:.2f})")for speaker in result.speakers:print(f" {speaker.id}: {speaker.sentiment.label} ({speaker.sentiment.score:.2f})")for point in result.trajectory:print(f" [{point.time}s] {point.label}: {point.score:.2f}")
Use Cases
Supported Input Formats
Quick Info
Try This Conversion
Get started with the Mixpeek API and convert your first file in minutes.
Frequently Asked Questions
Related Converters
Audio to Text
Transcribe audio files into text with high accuracy. Supports speaker diarization, punctuation restoration, timestamps, and over 50 languages. Handles podcasts, calls, meetings, and broadcast audio.
Audio to Summary
Generate concise summaries from audio recordings by transcribing speech and synthesizing key points. Supports meeting minutes, podcast summaries, and interview highlights with configurable length and format.
Video to Metadata
Extract comprehensive technical and semantic metadata from video files. Returns codec details, resolution, duration, frame rate, and AI-generated semantic tags including detected objects, scenes, dominant colors, and content categories.
Audio to Keywords
Extract semantically relevant keywords and key phrases from audio recordings. Transcribes speech, identifies salient terms using NLP, and ranks them by relevance and frequency. Ideal for content tagging, topic detection, and search optimization.
Ready to convert audio to sentiment?
Start using the Mixpeek Audio to Sentiment in minutes. Sign up for a free API key and follow the documentation to get started.
