Quickstart
Create a namespace
No schema needed — MVS infers vector dimensions on first write.You can optionally pre-declare vector configs if you want to set a specific distance metric:
Create a retriever (one-time)
Querying is unified on retrievers — one query concept whether you bring your own vectors or promote to managed embedding. Create a retriever once; for a standalone namespace it takes the query vector you computed (
input_mode: vector). All requests are scoped to the namespace via the X-Namespace header.Architecture
Standalone vs Managed
Every namespace runs in one of two modes. Start standalone and promote when you’re ready — no reindexing.| Standalone | Managed | |
|---|---|---|
| Query latency | Lower — no embedding at query time | +50-200ms for auto-embedding |
| Embedding cost | You pay your provider directly | Included in platform pricing |
| Model flexibility | Any model, any fine-tune | Bound to registered inference services |
| Write path | Direct upsert only | Collections auto-process + direct upsert |
| Search input | Pre-computed vectors, text (BM25), sparse | Also accepts raw text/URLs (auto-embedded) |
| Best for | Existing ML infra, low-latency, custom models | End-to-end processing, file pipelines |
Features
All features work identically in standalone and managed modes.| Capability | Description |
|---|---|
| Dense search | Vector similarity with cosine, dot product, or euclidean distance |
| Sparse search | Sparse vector queries for learned sparse representations (SPLADE, etc.) |
| BM25 keyword search | Full-text search on payload fields via text indexes |
| Hybrid search | Combine dense + BM25 + sparse in one query with RRF or DBSF fusion |
| Metadata filtering | Filter on any payload field — combine with any search type |
| Payload indexes | Manual or adaptive — auto-created based on query patterns |
| Schema-on-write | Auto-create vector indexes on first upsert — no upfront declaration needed |
| Usage metrics | Per-namespace breakdowns of vectors, storage tiers, queries, and writes |
| Storage tiering | Hot, cold, archive tiers — see storage tiering |
| Namespace cloning | Clone namespaces for testing or environment branching |
Billing
MVS pricing is pure usage-based — no per-vector caps, no namespace limits. Tiers gate support level, not features.| Resource | Price |
|---|---|
| Storage | $0.023 / GB / month |
| Hot cache | $25 / GB / month |
| Queries | $1 / 1M queries |
| Writes | $1 / 1M writes |
| Tier | Minimum | Support |
|---|---|---|
| Starter | $0/mo | Community |
| Growth | $50/mo (usage applies toward minimum) | Email + SLA |
| Enterprise | Custom | Dedicated + SSO + HIPAA |
GET /v1/organizations/billing/usage/vector-backend) or view it in the Studio dashboard under Billing.
Next Steps
Namespaces
Vector indexes, metrics, BM25
Documents & Search
Upsert, query, manage
Promote
Standalone → managed
Ready to go beyond BYO vectors? Promote your standalone namespace to managed mode and unlock automatic embedding, file processing pipelines, and enrichment — without reindexing. Your retrievers keep working unchanged — after promotion, the same retriever can auto-embed raw text instead of taking a pre-computed vector. See the migration guide for details. Learn how to promote →

