Documentation Index Fetch the complete documentation index at: https://docs.mixpeek.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
The Score Normalize stage rescales document scores using statistical normalization methods, enabling meaningful comparison across different scoring sources and consistent downstream thresholding.
Stage Category : SORT (Rescales scores)Transformation : N documents → N documents (same order, normalized scores)
When to Use
Use Case Description Hybrid search fusion Normalize text and vector scores before combining Score thresholding Set consistent cutoffs across different retrievers Cross-model comparison Make scores from different models comparable Probability ranking Convert scores to probability distribution Multi-stage pipelines Normalize between reranking stages
When NOT to Use
Scenario Recommended Alternative Reordering by relevance sort_relevanceReranking with cross-encoders rerankFiltering by score threshold attribute_filter on score fieldSingle scoring source Scores are already comparable
Parameters
Parameter Type Default Description methodstring min_maxNormalization method: min_max, z_score, softmax, l2 score_fieldstring scoreField containing the score to normalize output_fieldstring nullWrite normalized score to this field (preserves original) min_valuefloat nullCustom minimum for min_max (uses actual min if null) max_valuefloat nullCustom maximum for min_max (uses actual max if null)
Normalization Methods
Method Formula Output Range Best For min_max(x - min) / (max - min) [0, 1] Bounded comparison z_score(x - mean) / std (-∞, +∞) Statistical thresholding softmaxexp(x) / Σexp (0, 1), sum=1 Probability distribution l2x / ‖x‖₂ [-1, 1] Geometric comparison
Configuration Examples
Min-Max to [0,1]
Z-Score with Separate Output
Softmax Probability
Custom Range Bounds
{
"stage_type" : "sort" ,
"stage_id" : "score_normalize" ,
"parameters" : {
"method" : "min_max" ,
"score_field" : "score"
}
}
Use output_field to preserve the original score alongside the normalized value. This is useful for debugging or when you need both raw and normalized scores downstream.
Metric Value Latency < 1ms Memory O(N) for score array Cost Free Complexity O(N) (two passes: stats + normalize)
Common Pipeline Patterns
Hybrid Search Fusion
[
{
"stage_type" : "filter" ,
"stage_id" : "feature_search" ,
"parameters" : {
"feature_uris" : [{ "input" : { "text" : "{{INPUT.query}}" }, "uri" : "mixpeek://text_extractor@v1/embedding" }],
"limit" : 50
}
},
{
"stage_type" : "sort" ,
"stage_id" : "score_normalize" ,
"parameters" : {
"method" : "min_max" ,
"score_field" : "score"
}
},
{
"stage_type" : "sort" ,
"stage_id" : "rerank" ,
"parameters" : {
"inference_name" : "baai_bge_reranker_v2_m3" ,
"query" : "{{INPUT.query}}" ,
"document_field" : "content"
}
}
]
Score Thresholding After Normalization
[
{
"stage_type" : "filter" ,
"stage_id" : "feature_search" ,
"parameters" : {
"feature_uris" : [{ "input" : { "text" : "{{INPUT.query}}" }, "uri" : "mixpeek://text_extractor@v1/embedding" }],
"limit" : 100
}
},
{
"stage_type" : "sort" ,
"stage_id" : "score_normalize" ,
"parameters" : {
"method" : "min_max"
}
},
{
"stage_type" : "filter" ,
"stage_id" : "attribute_filter" ,
"parameters" : {
"AND" : [
{ "field" : "score" , "operator" : "gte" , "value" : 0.5 }
]
}
}
]
Error Handling
Error Behavior Single document min_max returns 1.0; z_score returns 0.0 All same scores min_max returns 1.0 for all; z_score returns 0.0 for all Score field missing Treated as 0.0 Non-numeric score Treated as 0.0