The process of configuring and maintaining search indices to achieve optimal trade-offs between query speed, accuracy, memory usage, and update throughput. Index optimization is critical for multimodal retrieval systems operating at scale.
Index optimization involves selecting the right index type, tuning its parameters, and maintaining it over time. For vector indices, this includes choosing between HNSW, IVF, or quantized variants, then tuning parameters like M (connections), ef (search width), and nprobe (clusters to search). For keyword indices, optimization covers analyzer configuration, field mapping, and segment merge policies.
Vector index optimization targets three dimensions: recall (accuracy), latency (speed), and memory (footprint). HNSW offers high recall and speed but high memory. IVF-PQ offers low memory but lower recall. Scalar quantization (SQ8) provides a middle ground. Keyword index optimization includes adjusting refresh intervals, merge policies, and replica counts. Benchmarking frameworks (ANN-benchmarks) help compare configurations systematically.
Connect a bucket and Mixpeek runs the whole multimodal search pipeline for you: extraction, indexing, and search over your own objects. No models to wire up, nothing to host.
Start with ManagedKeep your embeddings on your own cloud and run dense, sparse, and BM25 search directly on object storage. First 1M vectors free.
Start with MVS