Documentation Index
Fetch the complete documentation index at: https://docs.mixpeek.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Three Primitives
| Primitive | What It Is | Role |
|---|---|---|
| Object | Raw file or record in a bucket (video, PDF, JSON row, image). | The input boundary. You upload objects. |
| Document | One row of output in a collection, produced by decomposition. | The query boundary. You search documents. |
| Feature | A named output attached to a document (embedding, transcript, OCR text, label, score). | The composition boundary. Retrievers reference features by URI. |
What Decomposition Decides
The feature extractor controls how an object is decomposed into documents. The strategy depends on the content type:| Content Type | Decomposition Strategy | Result |
|---|---|---|
| Video | Time intervals, scene boundaries, or silence gaps | Each segment = 1 document with visual embedding + transcript + scene description |
| Audio | Silence boundaries or fixed intervals | Each segment = 1 document with transcript + transcript embedding |
| PDF / Document | Page, paragraph, or sentence boundaries | Each chunk = 1 document with text content + text embedding |
| Image | No split (1:1) | 1 image = 1 document with visual embedding + OCR + description |
| Structured data | Row-level (1:1) | 1 row = 1 document with field-level features |
Why It Matters
Without decomposition, a 30-minute video is one record. Searching for “the moment the CEO mentions revenue” means scanning the entire video. There’s no way to return a specific timestamp. With decomposition, that video becomes ~180 ten-second segments, each with its own transcript embedding, visual embedding, and scene description. A search returns the exact segment at 14:30 where the CEO says “revenue grew 22%.” The same applies to documents: a 200-page PDF becomes 200 searchable chunks instead of one monolithic record.Feature URIs
Every feature produced by decomposition gets a URI that uniquely identifies it:multimodal_embedding in one stage and face_embedding in another, even though they were produced by different extractors.
Configuring Decomposition
Decomposition is configured via thefeature_extractor field on a collection:
settings object controls the decomposition strategy. Each extractor has its own settings — see the extractor-specific pages for details:
From Video
Time, scene, and silence segmentation strategies
From Images
Visual embeddings, OCR, and structured extraction
From Audio
Silence-boundary segmentation and transcription
From Documents
Page, paragraph, and sentence chunking

