Video Learning Hub
Master multimodal AI concepts through comprehensive tutorials, guides, and best practices from our expert team.
Trusted by engineers at
Introducing Batch Retrieval — 50 Queries, One API Call, Real-Time SSE Streaming
Run up to 50 multimodal search queries in a single API call with server-side parallelism and real-time Server-Sent Events streaming. This launch video walks through the batch retriever execution endpoint using a brand protection use case — scanning marketplace listings for counterfeits. What you'll learn: ⚡ Batch execution endpoint: POST /v1/retrievers/{id}/execute/batch ⚡ Server-side concurrency (up to 20 parallel workers) ⚡ Real-time SSE streaming as each query completes ⚡ Per-query error isolation ⚡ Use cases: brand protection, content moderation, IP clearance ⚡ 5× faster than sequential execution

Mux + Mixpeek — Find Any Moment in Your Video Library in Seconds
See how the Mux + Mixpeek integration turns your entire video library into a searchable, intelligent asset. This walkthrough covers connecting Mux to Mixpeek via selective sync, running multimodal extraction on every video, and searching across scenes, faces, objects, spoken words, and on-screen text from a single query. What you'll learn: ⚡ Connecting Mux as a data source with selective sync ⚡ Metadata filters and passthrough flags for cost-efficient indexing ⚡ Multimodal decomposition: visual embeddings, face identity, OCR, transcription ⚡ Building a visual search retriever over your Mux library ⚡ Real-time webhook updates and automatic reconciliation

Stop Paying S3 Prices: Build a Video AI Pipeline with Backblaze + Mixpeek
Learn how to build a cost-effective video AI pipeline by replacing S3 with Backblaze B2 as your object storage backend. This walkthrough covers connecting Backblaze to Mixpeek, ingesting video content, and running multimodal feature extraction at a fraction of the cost. What you'll learn: ⚡ Setting up Backblaze B2 as a Mixpeek data source ⚡ Ingesting video files from B2 buckets ⚡ Running multimodal AI pipelines on stored content ⚡ Cost comparison vs AWS S3 ⚡ End-to-end pipeline from storage to searchable AI features

IP Safety Scanner: Pre-Publication Copyright Detection with Mixpeek
A walkthrough of the IP safety scanner showing how to detect celebrity likenesses, brand logos, and copyrighted audio in video and image content before publication. What you'll learn: ⚡ How pre-publication IP clearance works ⚡ Face detection against custom reference corpora ⚡ Logo and trademark recognition in video frames ⚡ Audio fingerprinting for copyrighted music ⚡ Setting confidence thresholds for automated clearance

Turn Web Scraping into Structured AI Data (Bright Data + Mixpeek Walkthrough)
A hands-on walkthrough showing how to combine Bright Data's web scraping infrastructure with Mixpeek to transform raw web content into structured, searchable AI data. What you'll learn: ⚡ Connecting Bright Data as a data source in Mixpeek ⚡ Scraping and ingesting web content at scale ⚡ Extracting structured context with multimodal AI ⚡ Making scraped data searchable via retrievers ⚡ End-to-end pipeline from raw web data to AI-ready output

Building an Exploratory Multimodal Retriever with the National Gallery
Discover how to build a powerful exploratory image board using multimodal search across 120,000 images from the National Gallery. This walkthrough demonstrates combining text search, reverse image search, and document-based queries into a unified retrieval experience using hybrid search with Reciprocal Rank Fusion (RRF). 👉 Live Demo: https://mxp.co/r/npg What you'll learn: ⚡ Building exploratory search interfaces for visual content ⚡ Combining text, image, and document reference queries ⚡ Implementing hybrid search with RRF for optimal results ⚡ Using Google SigLIP embeddings for image understanding ⚡ Creating multi-stage retriever pipelines with feature search ⚡ Capturing user signals for recommendation systems ⚡ Architecture patterns: Objects → Buckets → Collections → Retrievers Real-world demo: Visual curation across 120k images, 12GB of data, with text + image + document hybrid queries. Full source code available in the Mixpeek showcase repository.

Web Scraper Guide
Learn how to use Mixpeek's Web Scraper to recursively crawl websites and extract multimodal content with automatic embeddings. This guide demonstrates crawling documentation sites, extracting code snippets and images, and making everything searchable with semantic embeddings. What you'll learn: ⚡ Recursive website crawling with depth control ⚡ Extracting text, code blocks, and images ⚡ Multimodal embeddings (E5-Large, Jina Code, SigLIP) ⚡ JavaScript rendering for SPAs ⚡ URL filtering and structured extraction ⚡ Building searchable knowledge bases from docs

Buckets Guide
Learn how to use Mixpeek Buckets for schema-backed data ingestion with automatic validation and lineage tracking. This guide demonstrates creating buckets, defining schemas, uploading objects with multimodal blobs, and processing them through collections. What you'll learn: ⚡ Creating buckets with JSON schema validation ⚡ Uploading objects with multimodal blobs (text, image, video, JSON) ⚡ Schema enforcement and blob type validation ⚡ Lineage tracking from source to documents ⚡ Integration with collections for feature extraction ⚡ Best practices for organizing multimodal data

Video Understanding: From Frames to Contextual Search
Master video understanding and how it differs from basic image understanding. This video covers frame extraction techniques (sampling, keyframe detection, scene-based), video embedding models that capture temporal context, and building sophisticated semantic video search applications. What you'll learn: ⚡ Video vs image understanding: temporal context matters ⚡ Frame extraction techniques: sampling, keyframe, scene-based ⚡ Frame-level vs video-level embeddings ⚡ How video embeddings capture motion and actions ⚡ Scene detection with AutoShot and semantic deduplication ⚡ Vertex AI multimodal embeddings for video ⚡ Building scene-based video search pipelines ⚡ Real demo: Contextual video retrieval in Mixpeek Studio