
Retrieval for agents, on your object storage.
Bring your own vectors with MVS, or let Managed extract and index your files. One retrieval API, on the object storage you already use.
Bring vectors
Agent-native vector store on object storage. Dense, sparse, and BM25 search. First 1M vectors free forever.
Connect files
Managed indexing extracts scenes, faces, OCR, transcripts, and embeddings from video, images, audio, PDFs, and docs.
See retrieval in action.
Search inside a video by what's on screen, said, or written, using the same hybrid retrieval your agents call through the API.
One install. Two paths.
Most retrieval stacks mean gluing together a vector DB, a file pipeline, and an agent layer. Mixpeek is one install with two ways in.
Bring embeddings
Retrieval that lives where your data does.
Vectors and extracted features persist on the object storage you already own, so there's no in-memory index to keep hot and no extra copies to manage.
Object storage first
S3, GCS, B2, Azure, R2, MinIO, and S3-compatible stores stay the system of record, so no data leaves your cloud.
Agent-ready retrieval
Tools get searchable context with metadata, filters, traces, and deterministic retrieval plans.
Production controls
Usage limits, audit trails, namespaces, and self-hosted deployment for real workloads.
Query Mixpeek from wherever your agent runs, with the same retrieval API everywhere.
From object to retrieval.
Watch a file get decomposed into features, indexed, and made searchable across talent, IP, taste, and compliance workflows.
Connect any object store. Every file becomes a hierarchy of typed, versioned features.
Multi-stage pipelines: search, filter, join, rerank. Deterministic, auditable traces.
Plugs into your existing stack.
Connect your storage, point Mixpeek at it, and every file becomes searchable by what's inside it. No migration, no code changes.

Mux
Every Mux upload becomes searchable by face, scene, transcript, and on-screen text, with no manual tagging.
View integration →
Backblaze B2
S3-compatible extraction at 1/5th the cost. Store on B2, extract with Mixpeek, zero egress fees.
View integration →Iconik
Every asset in your DAM becomes findable by what's inside it: scenes, faces, spoken words, on-screen text.
View integration →In production right now.
Visual search across 45k artworks
Upload any image and find visually similar paintings across 45,000+ artworks, or just describe what you're looking for. Hybrid image and text retrieval, ranked with RRF.
Try gallery search →Posters that learn your taste
Like or dislike movie posters and watch the grid adapt to your taste in real time. Interaction signals feed learned fusion so recommendations improve from usage.
Try movie personalization →Face search across video
Drop in a headshot and find every clip a person appears in across 63 video ads and 2,600+ faces. Full trace for takedown evidence.
Try face search →Free vectors. Usage-based indexing.
MVS starts with free vectors. Managed starts with credits for object extraction and indexing.
Bring your own embeddings. Store and search vectors on object storage with no expiration on the free tier.
Start with MVSCredits cover extraction, embedding, indexing, enrichment, and retriever execution for raw objects.
Start with ManagedDedicated infrastructure, self-hosted options, SSO, SLA, security reviews, and hands-on architecture support.
Talk to usCommon questions.
Do I have to move my data?
No. Mixpeek reads from your existing S3, GCS, R2, Azure, or S3-compatible bucket. Your storage stays the system of record, and nothing leaves your cloud.
How fast is retrieval?
Hybrid queries (dense, sparse, and BM25) return in well under 100ms p95, even with vectors persisted on object storage rather than held in RAM.
Do I need embeddings to start?
No. Bring your own vectors with MVS, or point Managed at raw files and it generates embeddings and features for you.
What can Managed extract?
Faces, scenes, transcripts, OCR, labels, and embeddings from video, images, audio, PDFs, and documents, all indexed at the object level.
Can I self-host?
Yes. Deploy in your own cloud (BYO-Cloud) with SOC 2 and HIPAA-ready controls, SSO, audit trails, and namespaces.
How does pricing work?
MVS starts free with 1M vectors and no expiration. Managed is usage-based credits covering extraction, embedding, indexing, and retriever execution.