Mixpeek Logo
    Schedule Demo

    Feature Extractors

    Configurable ETL pipelines that extract structured data from multimodal content, specific to your use case. They are then paired with retrievers to create multimodal search pipelines.

    Learn more in docs

    Activity Grouping

    Detect, categorize, and group activities in video content

    420K runs
    Popular

    Face Grouping

    Detect, track, and group faces across video frames

    10K runs
    Popular

    Facial Recognition

    Detect and identify faces in images with high accuracy

    650K runs
    Popular

    Late Interaction Ranker

    Ranks a list of documents against a query using late interaction models (e.g., ColBERT). Produces relevance scores.

    0K runs
    Popular

    Object Detection

    Identify and locate objects within images with bounding boxes

    631K runs
    Popular

    Object Grouping

    Segment and group objects across video frames

    0K runs
    Popular

    PII Redactor

    Detect and redact personally identifiable information from text, transcripts, and OCR output

    183K runs
    Popular

    Seamless Expressive Translation

    Translate speech across languages while preserving emotional tone, pauses, and vocal style

    284K runs
    Popular

    Video Embedding

    Generate vector embeddings for video content

    610K runs
    Popular

    XceptionNet Deepfake Detector

    Detects manipulated facial regions using a CNN trained on the FaceForensics++ dataset.

    340K runs
    Popular

    Accent & Dialect Identification

    Identify accents and regional speech patterns

    310K runs

    Acoustic Scene Classification

    Identify the environment where audio was recorded

    340K runs
    Page 1 of 7