Multimodal Extractor v2 with Gemini Embedding 2
New multimodal extractor generates 3072-dimensional embeddings using Gemini Embedding 2, enabling richer cross-modal search across text, images, and video.
Changelog
New capabilities and performance improvements across the Mixpeek platform. Every entry links to relevant documentation and examples.
New multimodal extractor generates 3072-dimensional embeddings using Gemini Embedding 2, enabling richer cross-modal search across text, images, and video.
Publish, discover, and install custom extractors across organizations. Includes typed SDK, manifest validation, CLI lint/test, auto-versioning, rollback, and BYO container support.
Connect external sources to Mixpeek with zero code. Iconik adapter for video DAMs, push-based webhook ingestion for any source, inbound email connector, and Supabase Storage. Org-level connections configure once and reuse across buckets.
New POST /v1/features/search endpoint for direct vector similarity search across extracted features, without needing a retriever.
Batch jobs now survive cluster deployments. If a worker dies mid-run, the job is automatically resubmitted from where it left off. Combined with zombie batch TTL that auto-cancels stale drafts and fails stuck jobs.
Batch jobs now show estimated time to completion based on historical job durations. Visible in the API response and Studio batch details.
Primary-replica architecture with WAL-based replication, automatic failover with split-brain prevention, per-namespace recovery gates, atomic snapshots, and multi-namespace pods with lazy-load and LRU eviction.
MVS serverless namespaces now boot in under 1 second even at 833K points, down from 30-60 seconds. Achieved by skipping monolithic shard.bin in serverless mode and lazy-loading partitions.
Vector store replicas now autoscale based on query load using Kubernetes HPA. Client-side gRPC load balancing distributes queries across all healthy replicas.
New retrieval strategy combines HNSW centroid routing with RaBitQ quantization for faster approximate nearest neighbor search with lower memory footprint.
New Annotations API lets you attach human feedback to documents and features. Supports bulk create, update, and delete — useful for fine-tuning retrieval relevance and training custom models.
Query results can now expand to include parent and child documents using $expand keywords. Filter aliases make complex lineage queries simple. New /ancestors and /descendants endpoints.
Retrievers now cache results at both the retriever and rerank stage level. Repeated queries return in single-digit milliseconds instead of running the full pipeline.
Bring your own inference to retriever pipelines with feature_uri on rerank and LLM stages, plus a new classify stage. Run your own models inline during retrieval.
New moment_group REDUCE stage groups search results by video moment, returning the most relevant segment of each video instead of individual frames.