How It Works
- Create a retriever that defines your search criteria (e.g., semantic similarity, attribute filters)
- Create an alert referencing that retriever, with notification channels configured
- Attach the alert to a collection via
alert_applicationswith input mappings - Ingest documents — alerts execute automatically during post-processing (Phase 3)
- Receive notifications when matches are found
Architecture
Alerts execute during the post-processing pipeline after document ingestion completes:| Phase | System | Purpose |
|---|---|---|
| 1 | Taxonomies | Vector-based classification |
| 2 | Clusters | Document grouping |
| 3 | Alerts | Retriever execution + notifications |
| 4 | Retriever Enrichments | Field write-back |
Parallel Execution
Within a single alert, document-level retriever calls execute in parallel as independent Ray tasks. If a batch ingests 100 documents, all 100 retriever calls fan out simultaneously rather than running sequentially. Results are aggregated after all calls complete, and a single notification is sent if any document produced matches. Multiple alerts on the same collection execute sequentially to avoid race conditions in notification delivery.Configuration
Create an Alert
Attach to a Collection
Attach alerts to collections viaalert_applications when creating or updating a collection:
Input Mappings
Input mappings connect document fields to retriever input parameters:| Source Type | Description | Example |
|---|---|---|
document_field | Extract value from the ingested document using a dot-notation path | {"source_type": "document_field", "path": "metadata.category"} |
constant | Pass a fixed value to the retriever | {"source_type": "constant", "value": "safety_check"} |
Execution Modes
| Mode | Behavior |
|---|---|
on_ingest | Execute automatically when documents are ingested (default) |
scheduled | Execute on a schedule (does not trigger on ingest) |
on_demand | Execute only when manually triggered |
Notification Channels
Webhook
Slack
Monitoring Executions
Track alert execution history to monitor performance and debug issues:Comparison with Other Enrichment Types
| Feature | Taxonomies | Clusters | Alerts | Retriever Enrichments |
|---|---|---|---|---|
| Purpose | Vector-based classification | Document grouping | Notifications on match | Arbitrary retriever pipelines |
| Output | Label + score fields | Cluster assignments | Webhook/Slack/email notifications | Configurable field write-back |
| Phase | 1 | 2 | 3 | 4 |
| Execution | Parallel per document | Batch | Parallel per document | Sequential per document |
| Use cases | Face matching, entity linking | Segmentation, pattern discovery | Content monitoring, safety checks | LLM classification, cross-collection joins |

