NEWVectors or files. Pick a path.Start →
    Models/Embeddings/lightonai/Reason-ModernColBERT
    HFText EmbeddingsCC-BY-NC-4.0

    Reason-ModernColBERT

    by lightonai

    Late-interaction retriever trained for reasoning-intensive search queries

    9.1Kdl/month
    150M classparams
    Identifiers
    Model ID
    lightonai/Reason-ModernColBERT
    Feature URI
    mixpeek://text_extractor@v1/lighton_reason_moderncolbert_v1

    Overview

    Reason-ModernColBERT is a PyLate ColBERT model fine-tuned from LightOn's GTE-ModernColBERT-v1 on the ReasonIR dataset. It targets retrieval problems where the query is not a short keyword string but a reasoning-heavy prompt that requires matching evidence across paragraphs.

    On Mixpeek, this makes it a useful text retrieval companion for agents. After visual, audio, or document extractors produce text evidence, Reason-ModernColBERT can retrieve passages that match an agent's intermediate reasoning state with token-level MaxSim scoring instead of collapsing each document into one dense vector.

    Architecture

    ModernBERT-based late-interaction retriever trained with PyLate. It maps queries and passages to sequences of 128-dimensional token vectors and scores them with MaxSim. The model supports 8,192-token documents and 128-token queries according to the model card.

    Mixpeek SDK Integration

    import { Mixpeek } from "mixpeek";
    const mx = new Mixpeek({ apiKey: "API_KEY" });
    await mx.collections.ingest({
    collection_id: "agent-evidence",
    source: { url: "s3://knowledge-base/extracted-text/" },
    feature_extractors: [{
    feature: "text_embeddings",
    model: "lightonai/Reason-ModernColBERT",
    params: {
    interaction: "late",
    max_document_tokens: 8192
    }
    }]
    });

    Capabilities

    • Reasoning-intensive retrieval over long passages
    • Late-interaction token matching with MaxSim
    • 8K-token document support
    • Useful for agent queries that include context, constraints, and partial findings
    • Fine-tuned on ReasonIR data from GTE-ModernColBERT-v1

    Use Cases on Mixpeek

    Retrieve transcript passages after an agent forms a multi-step hypothesis
    Search document evidence with long, constraint-heavy queries
    Rerank OCR and caption text generated from multimodal pipelines
    Build retrieval tools for agents that search while they reason

    Benchmarks

    DatasetMetricScoreSource
    BRIGHTNDCG@10Outperforms models up to 7BLightOn model card
    Stack Exchange splitsNDCG@10+2.5 average over ReasonIR-8BLightOn model card

    Performance

    Input Size8,192-token documents, 128-token queries
    Embedding Dim128 per token
    GPU LatencyInput dependent
    GPU ThroughputBatch dependent
    GPU MemoryModel dependent

    Late interaction increases index size relative to single-vector dense retrieval

    Specification

    FrameworkHF
    Organizationlightonai
    FeatureText Embeddings
    Output1024-dim vector
    Modalitiesdocument, audio
    RetrieverText Similarity
    Parameters150M class
    LicenseCC-BY-NC-4.0
    Downloads/mo9.1K

    Research Paper

    Reason-ModernColBERT

    arxiv.org

    Build a pipeline with Reason-ModernColBERT

    Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.

    Open Studio