Browse the extractor catalog on GitHub
Runnable reference for every built-in Mixpeek extractor — inputs, parameters, output fields, embedding models, and copy-paste examples. Auto-generated from the live registry, so it always matches production.
How It Works
Instead of a self-service marketplace, community extractors follow a submission and review process:- Develop — Build your extractor following the Extractor Developer Guide
- Submit — Upload your extractor archive via
POST /v1/extractors/submissions - Review — The Mixpeek team reviews the submission for quality, security, and compatibility
- Merge — Approved extractors are merged into
engine/extractors/and become available as built-in extractors
Submitting an Extractor
Package your extractor as a zip archive and submit it for review:Submission Requirements
- Extractor must pass validation and security scan (same rules as custom extractor uploads)
- Include a complete
manifest.pywith correctfeaturesdefinitions - Include working
pipeline.pyand optionallyrealtime.py - Archive must be under 500 MB
Review Process
After submission:- The archive is validated and scanned automatically
- The Mixpeek team reviews the extractor code for quality and security
- If approved, the extractor code is merged into
engine/extractors/ - The submitting organization is notified and the extractor becomes available to all users
Trust Tiers
| Tier | Description |
|---|---|
community | Extractors submitted by community members and approved by Mixpeek |
verified | Extractors with additional performance and reliability validation |
official | Extractors developed and maintained by Mixpeek |

