Automatically enrich your data with extracted metadata: entities, topics, sentiment, language, and custom attributes. Transform raw content into structured, queryable data.

text

image

video

audio

Single Tier

4.7K runs

Run in Builder

from mixpeek import Mixpeek

client = Mixpeek(api_key="YOUR_API_KEY")

namespace = client.namespaces.create(name="enriched-data")
collection = client.collections.create(
    namespace_id=namespace.id,
    name="customer-feedback",
    extractors=[
        "entity-extraction",
        "topic-classification",
        "sentiment-analysis",
        "language-detection"
    ]
)

# Upload content - metadata extracted automatically
client.buckets.upload(
    collection_id=collection.id,
    url="s3://your-bucket/feedback/"
)

# Query enriched data
positive_feedback = client.documents.search(
    namespace_id=namespace.id,
    filters={
        "sentiment": "positive",
        "topic": "product-quality"
    }
)

Feature Extractors

Retriever Stages

Related Recipes & Resources

Explore these related resources to deepen your understanding and discover more powerful features

Glossary

Sentiment Analysis

Detecting emotional tone and opinion in text

Learn more

Recipe

Multimodal RAG Pipeline

Build a retrieval-augmented generation system that works with text, images, and video. Feed relevant multimodal context to LLMs for grounded responses.

Learn more

Recipe

Taxonomy Enrichment Pipeline

Automatically classify and tag content using custom taxonomies. Map your content to IAB categories, custom hierarchies, or industry-specific classifications.

Learn more

Recipe

Content Clustering Pipeline

Automatically group similar content together using embedding-based clustering. Discover themes, identify duplicates, and organize large content libraries.

Learn more

Recipe

Multimodal Knowledge Base

Consolidate documents, videos, images, and audio into a single searchable knowledge base with RAG capabilities. Supports natural language Q&A across all content types, with citations linking back to the exact source document, video timestamp, or image.

Learn more

Recipe

Feature Extraction

Multi-tier feature extraction that decomposes content into searchable components: embeddings, transcripts, detected objects, OCR text, scene boundaries, and more. The foundation for all downstream retrieval and analysis.

Learn more