NEWManaged multimodal retrieval.Explore platform →
    Models/Embeddings/jinaai/jina-embeddings-v5-text-nano
    HFText EmbeddingsCC-BY-NC-4.0

    jina-embeddings-v5-text-nano

    by jinaai

    Smallest high-quality multilingual text embedding at 239M parameters

    451Kdl/month
    239Mparams
    Identifiers
    Model ID
    jinaai/jina-embeddings-v5-text-nano
    Feature URI
    mixpeek://text_extractor@v1/jina_embeddings_v5_nano_v1

    Overview

    Jina Embeddings v5 Text Nano is a 239M-parameter multilingual text embedding model built on the EuroBERT-210M backbone. It achieves 71.0 on MTEB English v2 — remarkably close to the 677M v5-text-small (71.7) at one-third the size. Trained via embedding distillation from Qwen3-Embedding-4B with task-specific contrastive losses, it retains quality under aggressive dimension truncation and binary quantization.

    On Mixpeek, jina-embeddings-v5-text-nano is the optimal choice for latency-critical and edge deployments where every millisecond counts. Its Matryoshka support (768 down to 32 dimensions) and robust quantization make it ideal for high-throughput text search at minimal compute cost.

    Architecture

    EuroBERT-210M backbone with last-token pooling. 239M parameters. Four task-specific LoRA adapters (retrieval, text-matching, clustering, classification). 8192-token context length. Matryoshka truncation from 768 to 32 dimensions.

    Mixpeek SDK Integration

    from mixpeek import Mixpeek
    mixpeek = Mixpeek(api_key="YOUR_API_KEY")
    mixpeek.ingest.documents(
    collection="knowledge_base",
    source={"type": "s3", "bucket": "docs"},
    pipeline={
    "embedding": {
    "model": "mixpeek://text_extractor@v1/jina_embeddings_v5_text_nano"
    }
    }
    )

    Capabilities

    • 71.0 avg on MTEB English v2 (best under 300M multilingual)
    • 768-dimensional embeddings with Matryoshka truncation to 32-dim
    • 8192 token context length
    • Multilingual support across 100+ languages
    • Robust under binary quantization for edge deployment

    Use Cases on Mixpeek

    Edge-deployed semantic search with sub-millisecond latency requirements
    High-throughput text indexing pipelines processing millions of documents
    Mobile and IoT applications requiring compact embedding models
    Cost-efficient multilingual search replacing larger embedding models

    Benchmarks

    DatasetMetricScoreSource
    MTEB English v2 (avg)Score71.0Model card
    MMTEB (multilingual)Score65.5Model card

    Performance

    Input SizeVariable
    GPU Latency~2ms per passage (A100)
    GPU Throughput~5000 passages/sec (A100, batch 256)
    GPU Memory~0.6 GB

    Specification

    FrameworkHF
    Organizationjinaai
    FeatureText Embeddings
    Output1024-dim vector
    Modalitiesdocument, audio
    RetrieverText Similarity
    Parameters239M
    LicenseCC-BY-NC-4.0
    Downloads/mo451K

    Research Paper

    Model paper or technical report

    arxiv.org

    Build a pipeline with jina-embeddings-v5-text-nano

    Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.

    Open Studio