An evolution of retrieval-augmented generation where AI agents autonomously decide what to retrieve, how to query, and when to perform additional retrieval steps based on intermediate results.
Agentic RAG moves beyond single-shot retrieval by introducing an agent layer that reasons about retrieval strategy. Instead of executing one fixed query against a vector store, the agent analyzes the user question, decomposes it into sub-questions if needed, selects appropriate data sources and retrieval methods for each, evaluates intermediate results for completeness and relevance, and decides whether additional retrieval rounds are needed. This iterative, autonomous approach produces higher quality answers for complex questions that span multiple topics, modalities, or knowledge domains.
Agentic RAG architectures typically include a planning component (an LLM that decomposes queries and selects tools), a retrieval toolkit (multiple search endpoints, filters, and data sources the agent can invoke), and a synthesis component (an LLM that combines retrieved context into a final response). The agent uses tool-calling capabilities to execute retrieval actions and observe results before deciding next steps. Mixpeek's composable retriever pipelines and multi-stage search capabilities provide the retrieval toolkit that agentic systems need, supporting filtered searches, cross-modal queries, and reranking as distinct tools the agent can invoke.