A composable pipeline architecture for search that chains discrete stages: filter candidates, sort by relevance, reduce duplicates, enrich with context, and apply business logic. Unlike single-query search, multi-stage retrieval lets you express complex information needs as a sequence of operations, similar to how SQL chains WHERE, ORDER BY, GROUP BY, and JOIN.
Each stage in the pipeline receives a result set from the prior stage and applies a transformation: a feature search stage performs vector similarity lookup, a filter stage applies metadata predicates, a rerank stage re-scores results using a cross-encoder, and an enrichment stage appends additional fields. Stages are configured declaratively and executed in sequence.
Connect a bucket and Mixpeek runs the whole multimodal search pipeline for you: extraction, indexing, and search over your own objects. No models to wire up, nothing to host.
Start with ManagedKeep your embeddings on your own cloud and run dense, sparse, and BM25 search directly on object storage. First 1M vectors free.
Start with MVS