NEWManaged multimodal retrieval.Explore platform →
    Models/Speech & Audio/CohereLabs/cohere-transcribe-03-2026
    NeMoTranscriptionApache-2.0

    cohere-transcribe-03-2026

    by CohereLabs

    #1 on Open ASR Leaderboard with 14-language support

    286Kdl/month
    2Bparams
    Identifiers
    Model ID
    CohereLabs/cohere-transcribe-03-2026
    Feature URI
    mixpeek://transcription@v1/cohere_transcribe_03_v1

    Overview

    Cohere Transcribe is a 2B-parameter automatic speech recognition model that ranks #1 on the Open ASR Leaderboard for English. Trained on 500K hours of audio data, it delivers 3x faster real-time processing compared to models of similar accuracy. The model supports 14 languages with strong multilingual performance.

    For multimodal search pipelines, accurate transcription is foundational -- every word in the transcript becomes searchable text. Higher transcription accuracy directly translates to better full-text search over audio and video content.

    Architecture

    Encoder-decoder architecture optimized for streaming and batch ASR. 2B parameters trained on 500K hours of diverse audio. Supports NeMo framework for enterprise deployment.

    Mixpeek SDK Integration

    import { Mixpeek } from "mixpeek";
    const mx = new Mixpeek({ apiKey: "API_KEY" });
    await mx.collections.ingest({
    collection_id: "my-collection",
    source: { url: "https://example.com/podcast.mp3" },
    feature_extractors: [{
    feature: "transcription",
    model: "CohereLabs/cohere-transcribe-03-2026"
    }]
    });

    Capabilities

    • #1 on Open ASR Leaderboard (English)
    • 14 language support with strong multilingual accuracy
    • 3x faster than comparable accuracy models
    • Apache-2.0 license for commercial use
    • NeMo framework support for enterprise deployment

    Use Cases on Mixpeek

    Video transcription: convert spoken content to searchable text
    Podcast indexing: make every spoken word findable
    Meeting recording search: extract action items and topics from meeting audio
    Multilingual content: transcribe content across 14 languages for unified search

    Specification

    FrameworkNeMo
    OrganizationCohereLabs
    FeatureTranscription
    Outputtext + timestamps
    Modalitiesvideo, audio
    RetrieverTranscript Search
    Parameters2B
    LicenseApache-2.0
    Downloads/mo286K

    Research Paper

    Cohere Transcribe

    arxiv.org

    Build a pipeline with cohere-transcribe-03-2026

    Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.

    Open Studio