Get detailed information about a specific extractor.
Works for both builtin extractors and custom plugins.
Parameters:
extractor_id: Extractor identifier (e.g., ‘text_extractor_v1’, ‘my_custom_plugin_1_0_0’)Response includes:
Documentation Index
Fetch the complete documentation index at: https://docs.mixpeek.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Bearer token authentication using your API key. Format: 'Bearer sk_xxxxxxxxxxxxx'. You can create API keys in the Mixpeek dashboard under Organization Settings.
"Bearer YOUR_MIXPEEK_API_KEY"
Namespace identifier for scoping this request. All resources (collections, buckets, taxonomies, etc.) are scoped to a namespace. You can provide either the namespace name or namespace ID. Format: ns_xxxxxxxxxxxxx (ID) or a custom name like 'my-namespace'. Falls back to ?namespace= query parameter if the header is omitted.
"ns_abc123def456"
"production"
"my-namespace"
Extractor details
Unified extractor response combining builtin and custom plugins.
This model provides a consistent view of all extractors available to a namespace, regardless of whether they are builtin or custom.
Name of the feature extractor
Version of the feature extractor
Unique identifier (name_version)
Origin of this extractor: 'builtin' (shipped with Mixpeek), 'custom' (user-uploaded plugin), or 'community' (marketplace)
builtin, custom, community Human-readable description
JSON schema for input data
JSON schema for output data
Lucide-react icon name for frontend rendering
JSON schema for parameters
What input types this extractor can handle: 'type_specific' (only one type, e.g. video-only) or 'multimodal' (handles multiple types with conditional processing). Type-specific extractors cannot use automatic-typed bucket properties.
For type-specific extractors: maps input keys to required types (e.g., {'video': 'video', 'thumbnail': 'image'}). For multimodal extractors: null.
Kind of real-time inference this extractor provides: 'embedding', 'rerank', 'classify', 'generate', or 'general'. Determines which retriever stages are compatible. Null if the extractor is batch-only.
Supported input types (video, image, text, etc.)
Maximum number of inputs per type
Default parameter values
Credit cost information (builtin extractors only)
Vector indexes this extractor produces
Payload indexes required by this extractor
Fields that identify unique positions within output documents. Used for deterministic document ID generation.
Primary feature URI (e.g., mixpeek://text_extractor@v1/embedding)
What this extractor can do: 'batch' (feature extraction during ingestion), 'realtime' (query-time inference for retriever stages)
Minimal working configuration for namespace + collection + input_mappings + parameters
Plugin ID (custom plugins only)
Whether the plugin is deployed (custom plugins only)
Validation status (custom plugins only)
passed, failed, pending Creation timestamp (custom plugins only)
Last update timestamp (custom plugins only)