embeddinggemma-300m
by google
Google's efficient multilingual text embedding -- 100+ languages in 300M parameters
google/embeddinggemma-300mmixpeek://text_extractor@v1/google_embeddinggemma_300m_v1Overview
EmbeddingGemma is Google's compact text embedding model based on the Gemma 3 architecture, designed for deployment on edge devices while maintaining strong multilingual performance. At 308M parameters, it runs in under 200MB of RAM when quantized and achieves sub-22ms inference on EdgeTPU.
Despite its small size, it ranks as the highest-performing open multilingual text embedding model under 500M parameters on MTEB. It supports Matryoshka representations (768 and 128 dimensions) for flexible memory/quality tradeoffs. On Mixpeek, it provides a lightweight embedding option for high-throughput text indexing where GPU resources are limited.
Architecture
Gemma 3 decoder backbone, 308M parameters. 2048 token context. Matryoshka dimensions: 768 (full) and 128 (compressed). Distilled from a larger teacher model. Optimized for TPU/mobile inference.
Mixpeek SDK Integration
import { Mixpeek } from "mixpeek";
const mx = new Mixpeek({ apiKey: "API_KEY" });
// Managed: create a collection over a bucket; Mixpeek runs this model's extractor
const collection = await mx.collections.create({
namespace_id: "my-namespace",
collection_name: "my-collection",
source: { type: "bucket", bucket_ids: ["bkt_your_bucket"] },
feature_extractor: {
feature_extractor_name: "text_embedding",
version: "v1",
parameters: { model_id: "google/embeddinggemma-300m" },
},
});Capabilities
- 100+ language support
- Matryoshka dimension reduction (768/128)
- Sub-200MB quantized footprint
- EdgeTPU and mobile-optimized
- 2048 token context
Use Cases on Mixpeek
Benchmarks
| Dataset | Metric | Score | Source |
|---|---|---|---|
| MTEB (multilingual avg) | Score | 64.1 | Google, 2025 -- Model Card |
Performance
Common Pipeline Companions
Explore on Mixpeek
Compare alternatives in this category
Hand-picked tools & platforms compared
Deep-dive technical guide
See how Mixpeek runs models as extractors
Store & search embeddings at scale
Usage-based pricing for pipelines
Compare models, APIs & infrastructure
Specification
Research Paper
EmbeddingGemma
arxiv.orgBuild a pipeline with embeddinggemma-300m
Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.
Open Studio