On-screen text with narration and background music
Detect scenes where on-screen text, human narration, and background music occur simultaneously using OCR, speech-to-text, and audio classification.
video
audio
text
4.6K runs
Deploy Recipe"Text appearing on screen while a person is narrating and background music plays"
from mixpeek import Mixpeekclient = Mixpeek("your-api-key")# Create a collection for identifying scenes with on-screen text, narration, and musicclient.collections.create(feature_extractors=["image-text-extraction","video-transcription","audio-classification"])
Feature Extractors
Image Text Extraction (OCR)
Extract text content from images and scan documents
725K runs
Video Transcription
Convert speech to text with timestamps for video content
385K runs
Audio Classification
Classify audio content into categories like music, speech, noise, etc.
0K runs
Retrievers
Hybrid KNN Search
Combines vector similarity with keyword matching for improved retrieval accuracy
325K runs