Multimodal Hybrid Search Pipeline
Combine vector search with keyword search (BM25) across text, images, and video for the most comprehensive multimodal retrieval system.
from mixpeek import Mixpeekclient = Mixpeek(api_key="YOUR_API_KEY")# Create a multi-extractor collectionnamespace = client.namespaces.create(name="multimodal-search")# Configure hybrid retrieverretriever = client.retrievers.create(namespace_id=namespace.id,name="hybrid-multimodal",stages=[{"type": "hybrid_search","vector_weight": 0.7,"bm25_weight": 0.3,"top_k": 100},{"type": "rerank","model": "colbert-v2","top_k": 20}])# Query across all modalitiesresults = client.retrievers.execute(retriever_id=retriever.id,query="quarterly revenue growth chart",filters={"modality": ["text", "image"]})
Feature Extractors
Retriever Stages
rerank
Rerank documents using cross-encoder models for accurate relevance
