Cross-MediaSimilar

RAG with MVS Standalone

Complete RAG pipeline using MVS for retrieval and OpenAI for generation. Chunk your documents, embed them with any provider, store in MVS, retrieve relevant context, and generate answers -- no managed feature extractors needed.

text

Multi-Tier

12.4K runs

Run in Builder

"What is the recommended database architecture for high availability?"

Why This Matters

Full control over your RAG pipeline without vendor lock-in. Choose your own chunking strategy, embedding model, and LLM while MVS handles the vector storage and retrieval at scale.

from openai import OpenAI
from mixpeek import Mixpeek

openai = OpenAI(api_key="your-openai-key")
mvs = Mixpeek(api_key="your-mvs-key")

NAMESPACE = "rag-docs"

def embed(text: str) -> list[float]:
    resp = openai.embeddings.create(model="text-embedding-3-small", input=text)
    return resp.data[0].embedding

# Step 1: Chunk documents
def chunk_text(text: str, chunk_size: int = 512, overlap: int = 64) -> list[str]:
    words = text.split()
    chunks = []
    for i in range(0, len(words), chunk_size - overlap):
        chunks.append(" ".join(words[i:i + chunk_size]))
    return chunks

# Step 2: Embed and upsert chunks into MVS
document = open("docs/architecture-guide.md").read()
chunks = chunk_text(document)

for i, chunk in enumerate(chunks):
    mvs.namespaces.documents.upsert(
        namespace=NAMESPACE,
        documents=[{
            "dense_embedding": embed(chunk),
            "content": chunk,
            "metadata": {
                "source": "architecture-guide.md",
                "chunk_index": i,
                "total_chunks": len(chunks)
            }
        }]
    )

# Step 3: Retrieve relevant chunks
query = "What is the recommended database architecture for high availability?"
results = mvs.namespaces.documents.search(
    namespace=NAMESPACE,
    query={"dense_embedding": embed(query)},
    top_k=5
)

# Step 4: Generate answer with LLM
context = "\n\n".join([
    f"[Chunk {doc['metadata']['chunk_index']}] {doc['content']}"
    for doc in results
])

response = openai.chat.completions.create(
    model="gpt-4o",
    messages=[
        {"role": "system", "content": f"Answer based on this context. Cite chunk numbers.\n\n{context}"},
        {"role": "user", "content": query}
    ]
)

print(response.choices[0].message.content)

Feature Extractors

Retriever Stages

limit

Truncate results to a maximum count with optional offset for pagination

reduce

Documentation

MVS Overview Document Search Namespaces

Related Recipes & Resources

Explore these related resources to deepen your understanding and discover more powerful features

Recipe

Hybrid BM25 + Dense Vector Search

Use MVS hybrid search to combine BM25 keyword matching with dense vector similarity. Get the precision of exact keyword matches and the recall of semantic understanding in a single query.

Learn more

Recipe

Multimodal Search with MVS

Build multimodal search by embedding different content types (text, images, video frames) with your own models and searching across them in a single MVS namespace. Use CLIP or any multimodal embedding model for cross-modal retrieval.

Learn more

Recipe

Document Intelligence Search

Extract and search through PDFs, presentations, and documents. Combines OCR, layout analysis, and semantic search for comprehensive document retrieval.

Learn more

Recipe

BYO Embeddings Vector Search

Bring pre-computed embeddings from any provider (OpenAI, Cohere, Together, etc.) and upsert them directly into MVS for instant vector search. No feature extractors, no pipelines -- just embeddings in, results out.

Learn more

Extractor

Web Scraper

Extract structured data from webpages while maintaining semantic context and relationships

Learn more

Extractor

Text Embedding

Extract semantic embeddings from documents, transcripts and text content

Learn more