What is MCP? The Model Context Protocol is an open standard that lets AI assistants connect to external tools and data sources. Instead of copy-pasting API calls, you describe what you want and Claude handles the rest.
Choose Your Server
MCP clients have context limits. Instead of loading all 43 tools, pick the scoped server that matches your workflow:Ingestion Server
18 tools — Buckets, collections, and documents.
Best for data pipelines and content upload workflows.
https://mcp.mixpeek.com/ingestion/mcpRetrieval Server
11 tools — Retrievers, agents, and search.
Best for RAG applications, search UIs, and agent workflows.
https://mcp.mixpeek.com/retrieval/mcpAdmin Server
14 tools — Namespaces, taxonomies, and clusters.
Best for platform administration and enrichment.
https://mcp.mixpeek.com/admin/mcpFull Platform
43 tools — Everything in one server.
Best for power users who need all capabilities.
https://mcp.mixpeek.com/mcpSetup
Add to your Claude Desktop or Claude Code config. ReplaceYOUR_API_KEY with your key from the Mixpeek dashboard.
- Ingestion
- Retrieval
- Admin
- Full Platform
Stdio (local development)
Run the server locally with an optional--scope flag:
Ingestion Server — 18 tools
Manage buckets, collections, and documents. Use this server when building data ingestion pipelines.Bucket Management (6 tools)
Bucket Management (6 tools)
Buckets store your raw files (videos, images, documents) before processing.
| Tool | Description |
|---|---|
create_bucket | Create a new bucket for file storage and processing |
list_buckets | List all buckets in a namespace |
get_bucket | Get details of a specific bucket |
update_bucket | Update bucket configuration |
delete_bucket | Delete a bucket and all its objects |
upload_object | Upload an object (file) to a bucket from a URL |
Collection Management (7 tools)
Collection Management (7 tools)
Collections define how your data is processed — which feature extractor runs, what embeddings are generated. Each collection has exactly one feature extractor.
Available feature extractors:
| Tool | Description |
|---|---|
create_collection | Create a collection with a feature extractor, source config, and optional taxonomy/cluster/alert applications |
list_collections | List all collections in a namespace |
get_collection | Get collection details |
update_collection | Update collection configuration |
clone_collection | Clone an existing collection with optional overrides |
trigger_collection | Trigger the processing pipeline on bucket objects |
delete_collection | Delete a collection and all its documents |
| Extractor | Description |
|---|---|
text_extractor | Text embeddings (multilingual E5) |
image_extractor | Image embeddings (CLIP, SigLIP) |
multimodal_extractor | Text + image joint embeddings |
face_identity_extractor | Face detection and recognition |
document_graph_extractor | Document structure extraction |
sentiment_classifier | Sentiment analysis |
web_scraper | Web page content extraction |
course_content_extractor | Video/course content processing |
Document Management (5 tools)
Document Management (5 tools)
Documents are the processed records stored in your namespace with extracted features and embeddings.
| Tool | Description |
|---|---|
create_document | Create a new document in a collection |
list_documents | List documents in a collection with filters |
get_document | Get a specific document by ID |
update_document | Update a document’s data |
delete_document | Delete a document from a collection |
Retrieval Server — 11 tools
Search and query your data. Use this server for RAG applications, search UIs, and agent workflows.Retriever Management (7 tools)
Retriever Management (7 tools)
Retrievers are multi-stage search pipelines. Chain stages together to search, filter, rerank, and enrich results.
29+ available stages across 5 categories:
Stages support template variables:
| Tool | Description |
|---|---|
create_retriever | Create a multi-stage search pipeline |
list_retrievers | List all retrievers in a namespace |
get_retriever | Get retriever configuration and all stages |
update_retriever | Update retriever metadata (name, description, tags) |
clone_retriever | Clone an existing retriever with optional modifications |
execute_retriever | Execute a retriever with inputs and get search results |
delete_retriever | Delete a retriever |
| Category | Stage IDs |
|---|---|
| Filter | feature_search, attribute_filter, llm_filter, query_expand, agent_search |
| Sort | sort_relevance, sort_attribute, rerank, mmr, score_normalize |
| Reduce | limit, group_by, aggregate, summarize, sample, deduplicate, cluster |
| Apply | json_transform, api_call, web_search, sql_lookup, cross_compare, unwind, rag_prepare |
| Enrich | llm_enrich, document_enrich, taxonomy_enrich, code_execution, web_scrape |
{{INPUT.field}}, {{DOC.field}}, {{STAGE.field}}, {{CONTEXT.field}}.Agent & Conversation (3 tools)
Agent & Conversation (3 tools)
Conversational AI sessions with retriever-backed responses.
| Tool | Description |
|---|---|
create_agent_session | Create a new conversational agent session |
send_agent_message | Send a message to an agent session and get response |
get_agent_history | Get conversation history for an agent session |
Search (1 tool)
Search (1 tool)
| Tool | Description |
|---|---|
search_namespace | Search across all resources in a namespace (buckets, collections, retrievers, etc.) |
Admin Server — 14 tools
Manage namespaces, taxonomies, and clusters. Use this server for platform administration and data enrichment.Namespace Management (5 tools)
Namespace Management (5 tools)
Namespaces are isolated workspaces. Each namespace maps to its own vector collection in Qdrant.
| Tool | Description |
|---|---|
create_namespace | Create a new workspace for organizing collections and resources |
list_namespaces | List all namespaces in your organization |
get_namespace | Get namespace details by ID or name |
update_namespace | Update namespace configuration |
delete_namespace | Delete a namespace and all its resources |
Taxonomy Management (5 tools)
Taxonomy Management (5 tools)
Taxonomies are hierarchical classification systems you can apply to documents.
| Tool | Description |
|---|---|
create_taxonomy | Create a hierarchical classification taxonomy |
list_taxonomies | List all taxonomies |
get_taxonomy | Get taxonomy details |
execute_taxonomy | Apply taxonomy classification to document data |
delete_taxonomy | Delete a taxonomy |
Cluster Management (4 tools)
Cluster Management (4 tools)
Clusters group similar documents together for discovery and organization.
| Tool | Description |
|---|---|
create_cluster | Create a document clustering configuration |
list_clusters | List all clusters |
execute_cluster | Execute clustering algorithm on collection |
delete_cluster | Delete a cluster configuration |
Retriever Server
The Retriever MCP server is a lightweight server scoped to a single retriever. It reads your retriever’sinput_schema at startup and generates a typed search tool whose parameters match exactly — so the AI assistant knows what inputs are available without any guesswork.
Tools
| Tool | Description |
|---|---|
search | Execute the retriever. Parameters are generated from the retriever’s input_schema — including the correct field names, types, required flags, enums, and descriptions. Pagination parameters (page, page_size) are added automatically. |
describe | Returns structured metadata: retriever ID, name, collections, input fields, and stage configuration. |
explain | Returns a human-readable explanation of the pipeline: what each stage does, in what order. |
How Dynamic Schema Works
When the server starts, it fetches your retriever’s configuration and converts itsinput_schema into a JSON Schema for the search tool. For example, if your retriever has:
search tool will expose query (required string) and category (optional enum) as typed parameters — plus page and page_size for pagination.
If your retriever’s
input_schema has a field named page or page_size, the pagination parameters are automatically renamed to _pagination_page and _pagination_page_size to avoid conflicts.Setup
- Claude Desktop (stdio)
- HTTP (deployed)
- Environment Variables
mixpeek-mcp-retriever CLI installed:Example Conversation
Once connected, you can interact naturally:Authentication & Security
All MCP servers use your existing Mixpeek API key with the same permissions as the REST API.- HTTP transport: Pass the API key in the
Authorization: Bearerheader. The server extracts it and injects it into every tool call. - Stdio transport: Set the
MIXPEEK_API_KEYenvironment variable or passapi_keyin tool arguments. - Same RBAC permissions as the REST API
- Rate limiting per organization
- Audit logging for all operations
- TLS encryption on the hosted server
Architecture
/ingestion, /retrieval, /admin) while the full server handles root-level requests.
Troubleshooting
Claude can't connect to the MCP server
Claude can't connect to the MCP server
- Verify the URL is correct (e.g.
https://mcp.mixpeek.com/ingestion/mcp) - Check that the
Authorizationheader format isBearer YOUR_API_KEY - Restart Claude Desktop or Claude Code after changing config
Tools return 'Unauthorized' or 'Invalid API key'
Tools return 'Unauthorized' or 'Invalid API key'
Tool not found (404)
Tool not found (404)
- You may be calling a tool on the wrong scoped server (e.g.
execute_retrieveron/ingestion) - Check
GET /toolson the scoped endpoint to see available tools - Use the full server (
/) if you need all tools
Retriever Server fails to start
Retriever Server fails to start
- Ensure
--retriever-idand--namespace-idare correct - Verify the API key has access to that namespace
- Check that the retriever exists:
GET /v1/retrievers/{id}
Search returns empty results
Search returns empty results
- Confirm your collection has processed documents (not just uploaded files)
- Check that the retriever’s
feature_urimatches your collection’s extractor - Try a broader query or remove optional filters
Slow response times
Slow response times
- Large file uploads depend on file size and network
- Multi-stage retrievers with LLM enrichment or reranking take more time
- Check status.mixpeek.com for service issues

