Skip to main content
GET
/
v1
/
retrievers
/
{retriever_id}
/
evaluations
/
{evaluation_id}
Get evaluation results
curl --request GET \
  --url https://api.mixpeek.com/v1/retrievers/{retriever_id}/evaluations/{evaluation_id} \
  --header 'Authorization: Bearer <token>'
{
  "evaluation_id": "<string>",
  "retriever_id": "<string>",
  "dataset_id": "<string>",
  "dataset_name": "<string>",
  "config": {
    "k_values": [
      1,
      5,
      10,
      20
    ],
    "metrics": [
      "precision",
      "recall",
      "f1",
      "map",
      "ndcg",
      "mrr"
    ]
  },
  "created_at": "2023-11-07T05:31:56Z",
  "updated_at": "2023-11-07T05:31:56Z",
  "namespace_id": "<string>",
  "internal_id": "<string>",
  "query_count": 123,
  "completed_at": "2023-11-07T05:31:56Z",
  "overall_metrics": {},
  "metrics_by_k": {},
  "total_queries": 123,
  "evaluated_queries": 123,
  "skipped_queries": 123,
  "error_message": "<string>"
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

retriever_id
string
required
evaluation_id
string
required

Response

Successful Response

Complete evaluation record with results.

evaluation_id
string
required

Unique evaluation identifier

retriever_id
string
required

ID of retriever being evaluated

dataset_id
string
required

ID of dataset used for evaluation

dataset_name
string
required

Name of dataset

config
EvaluationConfig · object
required

Evaluation configuration

status
enum<string>
required

Current status

Available options:
pending,
in_progress,
completed,
failed
created_at
string<date-time>
required

When evaluation was created

updated_at
string<date-time>
required

Last update timestamp

namespace_id
string
required

Namespace ID

internal_id
string
required

Internal organization ID

query_count
integer
required

Number of queries evaluated

completed_at
string<date-time> | null

When evaluation completed

overall_metrics
Overall Metrics · object | null

Aggregated metrics across all queries

metrics_by_k
Metrics By K · object | null

Metrics broken down by K value (keys are string K values like '5', '10', '20')

total_queries
integer | null

Total queries in the dataset for this run (= evaluated_queries + skipped_queries).

evaluated_queries
integer | null

Number of queries that produced metrics. May be < total_queries when some queries were skipped (skip-and-continue on empty/failing input).

skipped_queries
integer | null

Number of queries skipped during evaluation (empty query_input or a per-query execution failure) — these did not fail the whole eval.

error_message
string | null

Error message if failed