Ethan Steininger

Former lead of MongoDB's Search Team, Ethan noticed the most common problem customers faced was building indexing and search infrastructure on their S3 buckets. Mixpeek was born.

Recent Posts

Multimodal AI for Contextual Advertising

Contextual advertising is changing. To adapt, businesses need to understand how multimodal AI works, why taxonomies matter, and what this means for the future of advertising.

September 18, 20253 min read

Migrate from IAB Content Taxonomy 2.x to 3.0 (Free Open-Source Mapper + CLI Tool)

Migrate from IAB Content Taxonomy 2.x to 3.0 in minutes with this free open-source mapper. Runs locally, offers a demo UI, pip/npm CLI, and AI-powered methods (TF-IDF, BM25, KNN, LLM re-rank) for accurate results.

September 16, 20254 min read

Taxonomy as Infrastructure: Migrating to IAB 3.0 in a Multimodal World

Accelerate your migration to IAB 3.0 and map messy internal taxonomies with AI. Learn how semantic tagging boosts monetization, RAG precision, and multimodal understanding—without lifting a finger.

July 11, 20254 min read

Intentflow: Open-Source UX Flow Engine for Product-Led Growth Teams

Intentflow is an open-source UX engine to trigger modals, tooltips, and banners using YAML, flags, and LLMs—built for growth teams.

July 5, 20252 min read

Building a Multimodal Deep Research Agent

Move beyond text-only search. Learn to build AI agents that reason across documents, videos, images, and audio for comprehensive multimodal research and analysis

June 6, 20257 min read

Meet Milo

Milo the Meerkat is the official mascot of Mixpeek.

June 5, 20252 min read

Automatic Speech Recognition: Build vs. Buy

Learn how to build a scalable ASR pipeline using Ray and Whisper, with batching, GPU optimization, and real-world tips from production deployments

June 5, 20253 min read

🚀 The Rise of the Dataset Engineer

Why the future of AI isn’t about bigger models — it’s about better data.

May 12, 20255 min read

Understanding Late Interaction Models in Multimodal Retrieval

Late interaction models enable precise retrieval from multimodal data like PDFs and images by comparing query tokens with token or patch embeddings—ideal for RAG, search, and document understanding.

May 8, 20254 min read

Video Segmentation: Unlocking Structure for Search and Analytics

Segmentation turns raw video into searchable chunks—objects, actions, scenes—boosting precision in multimodal search. It bridges unstructured content with structured queries like “man falling in warehouse,” enabling faster, more accurate retrieval across large datasets.

April 22, 202511 min read

AI-Powered Content Recommendations for AdTech

Even with data-driven targeting, most ads still miss. Contextual AI changes that—boosting relevance, clicks, and ROI without cookies.

April 13, 20252 min read

Contextual Advertising After the Death of Cookies

As Google phases out third-party cookies, advertisers face declining performance from behavioral targeting. Learn how Contextual AI offers a privacy-safe, high-precision alternative.

April 8, 20255 min read

🎯 Multimodal Monday #2 — From Tiny VLMs to 10M‑Token Titans

📢 Quick Take (TL;DR) * Major multimodal model releases: Meta unveiled Llama 4 Scout & Maverick – open Mixture-of-Experts models with native text+image (and even video/audio) support – and Microsoft introduced Phi-4-Multimodal, a compact 3.8B-param model integrating vision, text, and spee (Today is the start of a new era of natively multimodal AI… | AI at Meta | 190 comments) ([2503.01743] Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs)7】. Bot

April 7, 20257 min read

Turning Frames into DataFrames: AI-Powered Video Analytics

By applying the classic group_by pattern to structured video data at index time, you can turn raw frames into searchable, analyzable DataFrames aligned with how your users explore footage.

April 6, 20255 min read

🧠 Multimodal Monday #1 - State of the Stack

Researchers introducing new methods to replace embeddings with discrete IDs for faster cross-modal search

April 2, 20255 min read