Concepts

Hierarchical Classification

Assign content to multi-level category hierarchies using embedding-based classification. Define your taxonomy once, then classify new content automatically with confidence scores.

video

image

text

Multi-Tier

76.0K runs

Run in Builder

"Show all educational tutorial videos classified under safe content with high confidence"

Why This Matters

Taxonomies are organizational infrastructure. Once defined, they enable consistent classification, compliance tagging, and structured navigation across all content.

import requests

API_URL = "https://api.mixpeek.com"
headers = {"Authorization": "Bearer YOUR_API_KEY", "X-Namespace": "your-namespace"}

# Create hierarchical taxonomy
taxonomy = requests.post(f"{API_URL}/v1/taxonomies", headers=headers, json={
    "taxonomy_name": "content_classification",
    "taxonomy_type": "hierarchical",
    "retriever_id": "ret_classifier",
    "input_mappings": {
        "query_embedding": "mixpeek://multimodal_extractor@v1/embedding"
    },
    "hierarchy": [
        {
            "node_id": "safe",
            "collection_id": "col_safe_examples",
            "enrichment_fields": ["metadata.category"]
        },
        {
            "node_id": "educational",
            "parent_node_id": "safe",
            "collection_id": "col_educational_examples",
            "enrichment_fields": ["metadata.topic"]
        }
    ]
}).json()

# Apply taxonomy to collection
requests.post(
    f"{API_URL}/v1/collections/col_my_content/apply-taxonomy",
    headers=headers,
    json={"taxonomy_id": taxonomy["taxonomy_id"]}
)

# Search within taxonomy categories
results = requests.post(
    f"{API_URL}/v1/retrievers/taxonomy-search/execute",
    headers=headers,
    json={"query": {"text": "educational tutorial videos"}}
).json()

for doc in results["documents"]:
    print(f"Document: {doc['document_id']}")
    print(f"  Category: {doc.get('taxonomy_path', 'N/A')}")

Feature Extractors

Image Embedding

Generate visual embeddings for similarity search and clustering

752K runs

Text Embedding

Extract semantic embeddings from documents, transcripts and text content

827K runs

Video Embedding

Generate vector embeddings for video content

610K runs

Retriever Stages

attribute filter

Filter documents by metadata attribute values using boolean logic

filter

Resources Used

Taxonomy

Content Taxonomy

Multi-level classification hierarchy with confidence thresholds

Documentation

Taxonomies

Use Cases Using This Recipe

Intermediate

Coming Soon

8 min

Product Affordance Intelligence

Understand what products can do, not just what they look like

+35% improvement

Search relevance (NDCG)

ecommerce

Who It's For

E-commerce platforms, product catalog managers, and merchandising teams managing 100K+ SKU catalogs

View Details

Advanced

Coming Soon

9 min

AdTech Creative Intelligence

Understand what makes ad creatives perform before they run

99% faster

Creative approval speed

advertising

Who It's For

Ad networks, DSPs, creative agencies, and brand marketing teams managing 10K+ creative assets monthly

View Details

Advanced

Coming Soon

7 min

Government Intelligence

Multimodal search and analysis for government document repositories

100% unified index

Cross-department search coverage

legal

Who It's For

Government agencies, policy researchers, compliance teams, and public affairs professionals managing multi-department document repositories

View Details

Beginner

Coming Soon

7 min

Asset Intelligence (DAM Auto-Labeling)

Auto-tag and organize digital assets with multimodal AI

95% reduction

Manual tagging effort

dam intelligence

advertising

entertainment

Who It's For

Creative teams, brand managers, and media companies managing 100K+ digital assets across DAM platforms

View Details

Intermediate

AI Content Moderation for User-Generated Content

Automatically detect and flag policy-violating content across text, images, and video

95%+ of violations flagged before going live

Pre-publication violation catch rate

media

Who It's For

UGC platforms, social media companies, marketplace operators, and community platforms processing 100K+ daily uploads requiring trust and safety review

View Details

Beginner

AI-Powered Digital Asset Management

Search, organize, and enrich your media library with multimodal AI

80% faster search-to-find

Asset discovery time

media

Who It's For

Media companies, creative agencies, brand teams, and publishers managing libraries of 500K+ images, videos, and documents across production workflows

View Details

Intermediate

Automated Video Tagging for Streaming

Auto-generate rich metadata for every scene, shot, and moment in your catalog

10x more tags than manual editorial process

Metadata tags per title

entertainment

Who It's For

Streaming platforms, content distributors, and VOD services managing catalogs of 10K+ titles that need rich metadata for discovery and recommendation

View Details

Intermediate

9 min

Visual Product Search for Ecommerce

Let shoppers search your catalog with images instead of keywords

2.3x increase for visual search users

Search-to-purchase conversion

ecommerce

Who It's For

Ecommerce platforms, online marketplaces, fashion retailers, home goods stores, and any product catalog with 10K+ SKUs where visual discovery drives conversion

View Details

Intermediate

10 min

Brand Safety Verification

AI-powered brand safety scoring for ad placements and content partnerships

95% reduction in unsafe ad adjacency

Brand safety violation rate

advertising

Who It's For

Brand safety teams at agencies, DSPs, SSPs, ad networks, and brand marketers who need to verify that ad placements and content partnerships meet safety standards before spend is allocated

View Details

Advanced

9 min

AI Compliance Document Review

Automate regulatory document review with multimodal AI understanding

10x faster

Review cycle time

legal

finance

Who It's For

Compliance teams, regulatory affairs departments, and legal operations groups reviewing 1,000+ regulatory documents per quarter across banking, insurance, pharma, and financial services

View Details

Advanced

12 min

Clinical NLP at Scale

Extract structured intelligence from clinical notes, pathology reports, and medical records

94% F1 on medical NER benchmarks

Entity extraction accuracy

healthcare

Who It's For

Healthcare IT teams, clinical informatics departments, and health systems processing thousands of clinical documents daily

View Details

Hierarchical Classification

Why This Matters

Feature Extractors

Retriever Stages

Resources Used

Documentation

Use Cases Using This Recipe

Product Affordance Intelligence

AdTech Creative Intelligence

Government Intelligence

Asset Intelligence (DAM Auto-Labeling)

AI Content Moderation for User-Generated Content

AI-Powered Digital Asset Management

Automated Video Tagging for Streaming

Visual Product Search for Ecommerce

Brand Safety Verification

AI Compliance Document Review

Clinical NLP at Scale

Related Recipes & Resources

Video Embedding

Text Embedding

Image Embedding

Video Embedding

Image Embedding

Brand Safety & Ad Verification Pipeline