A system for searching across different data types using one or more modalities as queries (e.g., using text to find images or vice versa).
Multimodal retrieval systems enable cross-modal search by mapping different data types into a shared semantic space. This allows users to search using one modality (e.g., text) to find relevant content in another modality (e.g., images).
Uses neural networks and embedding models to create unified representations. Implements efficient indexing and retrieval mechanisms, often combining multiple ranking strategies and reranking approaches.
Connect a bucket and Mixpeek runs the whole multimodal search pipeline for you: extraction, indexing, and search over your own objects. No models to wire up, nothing to host.
Start with ManagedKeep your embeddings on your own cloud and run dense, sparse, and BM25 search directly on object storage. First 1M vectors free.
Start with MVS