Warehouse pricing, not database pricing. Pay for extraction and storage, not per query.
Choose the plan that fits your needs. Scale as you grow with our flexible pricing options.
Get started with basic features for personal or small projects.
Pay only for what you use. No base fee, no commitments.
Custom solutions for large-scale enterprise needs with volume discounts.
Vector databases charge per vector stored and per query. That model breaks down when 80% of your data is queried less than once a week. A warehouse separates storage from compute so you only pay for what you actually use.
Hot data in Qdrant for sub-10ms queries. Warm data in S3 Vectors at 90% lower cost. Cold data archived to metadata-only.
Data automatically migrates between tiers based on access patterns.
Searches and retrievals are free. You pay for extraction (ingestion) and storage. No per-query fees on your own data.
Compare: Pinecone charges per query + per vector stored in hot memory.
Extraction runs on-demand via serverless Ray clusters. No GPU instances running while you sleep.
Enterprise customers can reserve capacity for predictable workloads.
Traditional vector database
~$4,200/mo
All vectors in hot memory, per-query fees, fixed pod costs
Mixpeek warehouse (80% warm tier)
~$680/mo
20M hot vectors + 80M warm (S3 Vectors) + free queries
Estimates based on typical pricing as of March 2026. Actual costs depend on vector dimensions, query patterns, and provider pricing. Contact us for a personalized cost analysis.
Each extractor is billed based on what it processes. Costs are measured in credits (1 credit = $0.001).
Extractors are grouped by complexity tier. Higher-tier extractors involve more compute-intensive ML models.
Video, image, and text embedding via Vertex AI (1408D unified space)
Face detection (SCRFD) and recognition (ArcFace 512D embeddings)
Playwright crawling with LLM-based content extraction
PDF layout understanding with optional VLM correction
Video decomposition into scenes, OCR, and transcription
Text embedding via E5 (1024D)
Image embedding via CLIP/SigLIP
Text sentiment classification
Storage only, no ML processing
1 credit = $0.001
Credits are deducted from your balance as extractors process data.
Pay per unit processed
Each extractor charges based on its input type: minutes of video, images, pages, tokens, etc.
Composable pricing
Chain multiple extractors in a pipeline. You only pay for the extractors you use.
Drag the sliders to estimate how many credits you'll need per month.
See what it takes to build multimodal processing infrastructure on your own.
| Component | Mixpeek | DIY on |
|---|---|---|
| Video/Image Processing | Included | Lambda + MediaConvert + Rekognition |
| Embedding Generation | Included | SageMaker + Bedrock |
| Vector Search | Included | OpenSearch |
| Storage | $2/GB | S3 + data transfer |
| Pipeline Orchestration | Included | Step Functions + EventBridge |
| Time to Production | Minutes | Months of engineering |
| Ongoing Maintenance | Managed | Dedicated team required |
Each feature extractor charges credits based on what it processes, minutes of video, number of images, text tokens, document pages, etc. 1 credit = $0.001. Credits are deducted from your account balance as extractors run. You can monitor usage in real time from the dashboard.
Extractors vary in computational complexity. Simple extractors like text embedding use lightweight models and cost as little as 1 credit per 1K tokens. Premium extractors like the multimodal extractor run GPU-intensive models for video segmentation, scene detection, and multi-modal embedding, costing 50 credits per minute of video.
Our usage-based pricing is pure pay-as-you-go with no base fee. You pay only for the credits consumed by your extractors at $0.001 per credit. Volume discounts are available for larger usage. Your costs scale linearly with your actual needs.
No, our usage-based plan is billed monthly with no long-term commitments. You can upgrade, downgrade, or cancel at any time.
There are no hard limits on the usage-based plan. You'll be billed for your actual usage. You can set spending caps and budgets in the dashboard to control costs.
Yes, you can compose multiple extractors in a single collection pipeline. Each extractor is billed independently based on its own rates. For example, you could run the multimodal extractor and face identity extractor on the same video, you'd pay for each separately.
We offer volume discounts based on credit usage: up to 10% off at 100K credits, up to 20% off at 500K, and up to 25% off at 1M+ credits. Contact sales for enterprise pricing.