Mixpeek Logo
    Reduce

    Deduplicate

    Remove duplicate documents based on field values

    Note: This playground provides simulated output to showcase functionality. No data is processed on our servers. Use this demo to explore the stage's configuration options before integrating it into your retriever pipeline.

    Configuration

    array
    Required

    REQUIRED. Fields to use for deduplication.

    string

    Which duplicate to keep: 'first' or 'last'.

    number | null

    OPTIONAL. For fuzzy deduplication, similarity threshold (0-1).

    Output

    No output yet

    Configure the stage parameters and click "Run Stage" to see the simulated output.