Deep Research Patterns

Deep research workflows orchestrate multiple retriever executions, enrichment passes, and synthesis steps to answer complex questions. Mixpeek’s stage catalog—search, filter, enrich, transform, compose—gives you the primitives to build these flows without bespoke infrastructure.

Building Blocks

Stage Type	Examples	Use in Research
Search	`semantic_search`, `hybrid_search`, `late_interaction_search`, `web_search`	Gather candidate documents across modalities and the open web
Filter	`filter` (structured/text/LLM/custom)	Narrow to relevant time ranges, entities, or sentiment
Enrich	`join` (direct/retriever), `taxonomy`	Attach structured context, e.g., taxonomy tags or related entities
Transform	`llm_generation`	Summarize, extract key facts, or generate structured notes
Compose	`retriever`, `external_api_call`	Chain sub-retrievers or call external services (e.g., fact-check APIs)

Common Patterns

Literature Review

Seed search using hybrid_search to retrieve recent papers.
Structured filter by publication date and venue.
Taxonomy join to classify by research area.
LLM generation stage to summarize findings with citations.
Store summaries alongside feature_id references for auditability.

Competitive Intelligence

Use web_search + web_lookup stages to pull public announcements.
Join with internal product docs via join@v1 (retriever strategy) to compare specs.
Apply a custom filter to spotlight price or feature gaps.
Generate a briefing memo with the llm_generation stage.

Incident Investigation

Collect relevant runbooks/logs via semantic_search over internal collections.
Use filter stages to isolate the incident window.
Enrich with taxonomy-based tags (taxonomy@v1) for impacted systems.
Summarize timeline and root cause via llm_generation, keeping citations.

Orchestrating Multi-Retriever Flows

Leverage the retriever@v1 compose stage to call sub-retrievers based on previous stage output:

{
  "stage_name": "retriever",
  "version": "v1",
  "parameters": {
    "retriever_id": "ret_internal_logs",
    "input_mappings": {
      "query_text": "{{inputs.primary_question}}",
      "time_range": "{{STAGE.filter.time_range}}"
    },
    "merge_strategy": "append"
  }
}

This pattern lets you create macro retrievers that orchestrate domain-specific sub-searches, enabling modular reuse.

Capturing Feedback

Record user signals with the Interactions API (click, long_view, positive_feedback, etc.).
Feed interactions back into rerankers or filter stages (“hide documents seen in this session”).
Combine interactions with analytics endpoints to optimize parameter choices (e.g., increase hybrid_search.limit if users often tap beyond top 10).

Operational Tips

Persist execution IDs – each execute response includes execution_id; link it to your research session for audit trails.
Monitor stage telemetry – stage_statistics identifies bottlenecks (e.g., LLM stages dominating latency).
Budget controls – set budget_limits on retrievers to cap time or credit consumption for exploratory workflows.
Cache intermediate results – use cache_stage_names for expensive discovery steps, especially when analysts reiterate queries.
Leverage tasks – schedule enrichment batches (clusters, taxonomies) ahead of time so research pipelines stay low-latency.

Suggested Architecture

Orchestration App
 ├─ Calls macro retriever (with compose stages)
 ├─ Logs execution IDs + user prompts
 ├─ Stores generated summaries & citations
 └─ Sends interactions back to Mixpeek

Behind the scenes, Mixpeek handles stage execution, caching, and lineage tracking. You focus on stitching together the right stages and presenting the synthesized output.

Next Steps

Review Retrievers for stage configuration details.
Learn how Filters and Taxonomies contribute structure to exploratory pipelines.
Use Operations → Observability to monitor research workloads in production.

Get Started

What Mixpeek Extracts

Retrieval

Platform

Resources

Deep Research Patterns

Building Blocks

Common Patterns

Literature Review

Competitive Intelligence

Incident Investigation

Orchestrating Multi-Retriever Flows

Capturing Feedback

Operational Tips

Suggested Architecture

Next Steps

Get Started

What Mixpeek Extracts

Retrieval

Platform

Resources

Documentation Index

​Building Blocks

​Common Patterns

​Literature Review

​Competitive Intelligence

​Incident Investigation

​Orchestrating Multi-Retriever Flows

​Capturing Feedback

​Operational Tips

​Suggested Architecture

​Next Steps

Building Blocks

Common Patterns

Literature Review

Competitive Intelligence

Incident Investigation

Orchestrating Multi-Retriever Flows

Capturing Feedback

Operational Tips

Suggested Architecture

Next Steps