Mixpeek Logo
    Schedule Demo

    On-screen text with narration and background music

    Detect scenes where on-screen text, human narration, and background music occur simultaneously using OCR, speech-to-text, and audio classification.

    video
    audio
    text
    4.6K runs
    Deploy Recipe

    "Text appearing on screen while a and "

    from mixpeek import Mixpeek
    client = Mixpeek("your-api-key")
    # Create a collection for identifying scenes with on-screen text, narration, and music
    client.collections.create(
    feature_extractors=[
    "image-text-extraction",
    "video-transcription",
    "audio-classification"
    ]
    )

    Feature Extractors

    Image Text Extraction (OCR)

    Extract text content from images and scan documents

    725K runs

    Video Transcription

    Convert speech to text with timestamps for video content

    385K runs

    Audio Classification

    Classify audio content into categories like music, speech, noise, etc.

    0K runs

    Retrievers

    Hybrid KNN Search

    Combines vector similarity with keyword matching for improved retrieval accuracy

    325K runs