Frames with multiple people arguing and high visual activity
Isolate heated moments by identifying multiple speakers arguing through speaker diarization and audio event detection, combined with high visual activity from action recognition.
video
audio
6.8K runs
Deploy Recipe"Frames with multiple people arguing while visual activity spikes"
from mixpeek import Mixpeekclient = Mixpeek("your-api-key")# Create a collection for detecting arguments during high-activity scenesclient.collections.create(feature_extractors=["speaker-diarization","audio-event-detection","action-recognition"])
Feature Extractors
Speaker Diarization
Identify and separate different speakers in audio
320K runs
Audio Event Detection
Detect specific audio events like gunshots, glass breaking, alarms, etc.
0K runs
Action Recognition
Identify and classify human actions in video
412K runs
Retrievers
Hybrid KNN Search
Combines vector similarity with keyword matching for improved retrieval accuracy
325K runs