Mixpeek Logo
    Concepts

    Hierarchical Classification

    Assign content to multi-level category hierarchies using embedding-based classification. Define your taxonomy once, then classify new content automatically with confidence scores.

    video
    image
    text
    Multi-Tier
    76.0K runs
    Deploy Recipe

    "Show all educational tutorial videos classified under safe content with high confidence"

    Why This Matters

    Taxonomies are organizational infrastructure. Once defined, they enable consistent classification, compliance tagging, and structured navigation across all content.

    import requests
    API_URL = "https://api.mixpeek.com"
    headers = {"Authorization": "Bearer YOUR_API_KEY", "X-Namespace": "your-namespace"}
    # Create hierarchical taxonomy
    taxonomy = requests.post(f"{API_URL}/v1/taxonomies", headers=headers, json={
    "taxonomy_name": "content_classification",
    "taxonomy_type": "hierarchical",
    "retriever_id": "ret_classifier",
    "input_mappings": {
    "query_embedding": "mixpeek://multimodal_extractor@v1/embedding"
    },
    "hierarchy": [
    {
    "node_id": "safe",
    "collection_id": "col_safe_examples",
    "enrichment_fields": ["metadata.category"]
    },
    {
    "node_id": "educational",
    "parent_node_id": "safe",
    "collection_id": "col_educational_examples",
    "enrichment_fields": ["metadata.topic"]
    }
    ]
    }).json()
    # Apply taxonomy to collection
    requests.post(
    f"{API_URL}/v1/collections/col_my_content/apply-taxonomy",
    headers=headers,
    json={"taxonomy_id": taxonomy["taxonomy_id"]}
    )
    # Search within taxonomy categories
    results = requests.post(
    f"{API_URL}/v1/retrievers/taxonomy-search/execute",
    headers=headers,
    json={"query": {"text": "educational tutorial videos"}}
    ).json()
    for doc in results["documents"]:
    print(f"Document: {doc['document_id']}")
    print(f" Category: {doc.get('taxonomy_path', 'N/A')}")

    Feature Extractors

    Image Embedding

    Generate visual embeddings for similarity search and clustering

    752K runs

    Text Embedding

    Extract semantic embeddings from documents, transcripts and text content

    827K runs

    Video Embedding

    Generate vector embeddings for video content

    610K runs

    Retriever Stages

    attribute filter

    Filter documents by metadata attribute values using boolean logic

    filter

    Documentation

    Use Cases Using This Recipe

    Intermediate
    Coming Soon
    8 min

    Product Affordance Intelligence

    Understand what products can do, not just what they look like

    +35% improvement

    Search relevance (NDCG)

    Who It's For

    E-commerce platforms, product catalog managers, and merchandising teams managing 100K+ SKU catalogs

    Advanced
    Coming Soon
    9 min

    AdTech Creative Intelligence

    Understand what makes ad creatives perform before they run

    99% faster

    Creative approval speed

    Who It's For

    Ad networks, DSPs, creative agencies, and brand marketing teams managing 10K+ creative assets monthly

    Advanced
    Coming Soon
    7 min

    Government Intelligence

    Multimodal search and analysis for government document repositories

    100% unified index

    Cross-department search coverage

    Who It's For

    Government agencies, policy researchers, compliance teams, and public affairs professionals managing multi-department document repositories

    Beginner
    Coming Soon
    7 min

    Asset Intelligence (DAM Auto-Labeling)

    Auto-tag and organize digital assets with multimodal AI

    95% reduction

    Manual tagging effort

    Who It's For

    Creative teams, brand managers, and media companies managing 100K+ digital assets across DAM platforms

    Intermediate

    AI Content Moderation for User-Generated Content

    Automatically detect and flag policy-violating content across text, images, and video

    95%+ of violations flagged before going live

    Pre-publication violation catch rate

    Who It's For

    UGC platforms, social media companies, marketplace operators, and community platforms processing 100K+ daily uploads requiring trust and safety review

    Beginner

    AI-Powered Digital Asset Management

    Search, organize, and enrich your media library with multimodal AI

    80% faster search-to-find

    Asset discovery time

    Who It's For

    Media companies, creative agencies, brand teams, and publishers managing libraries of 500K+ images, videos, and documents across production workflows

    Intermediate

    Automated Video Tagging for Streaming

    Auto-generate rich metadata for every scene, shot, and moment in your catalog

    10x more tags than manual editorial process

    Metadata tags per title

    Who It's For

    Streaming platforms, content distributors, and VOD services managing catalogs of 10K+ titles that need rich metadata for discovery and recommendation