NEWAgents can now see video via MCP.Try it now →

    Automatic Speech Recognition Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 289300 of 300 models

    Automatic Speech Recognition

    kyutai/stt-2.6b-en-trfs

    4K
    10
    transformers
    Automatic Speech Recognition

    reazon-research/japanese-wav2vec2-base-rs35kh

    3K
    2
    transformers
    Automatic Speech Recognition

    Harveenchadha/vakyansh-wav2vec2-telugu-tem-100

    3K
    transformers
    Automatic Speech Recognition

    unsloth/whisper-large-v3-turbo

    3K
    10
    transformers
    Automatic Speech Recognition

    ivrit-ai/whisper-large-v3-turbo

    3K
    9
    transformers
    Automatic Speech Recognition

    onnx-community/whisper-large-v3-turbo_timestamped

    3K
    11
    transformers.js
    Automatic Speech Recognition

    oxide-lab/whisper-large-v3-turbo-GGUF

    3K
    Automatic Speech Recognition

    ai4bharat/indic-seamless

    3K
    19
    transformers
    Automatic Speech Recognition

    kotoba-tech/kotoba-whisper-v2.1

    3K
    19
    transformers
    Automatic Speech Recognition

    steja/whisper-large-persian

    3K
    14
    transformers
    Automatic Speech Recognition

    onnx-community/granite-4.0-1b-speech-ONNX

    3K
    7
    transformers.js
    Automatic Speech Recognition

    mlx-community/whisper-base-mlx

    3K
    1
    mlx
    13 / 13