NEWAgents can now see video via MCP.Try it now →

    Automatic Speech Recognition Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 124 of 300 models

    Automatic Speech Recognition

    pyannote/speaker-diarization-3.1

    10.2M
    1,795
    pyannote-audio
    Automatic Speech Recognition

    argmaxinc/whisperkit-coreml

    8.1M
    169
    whisperkit
    Automatic Speech Recognition

    openai/whisper-large-v3-turbo

    7.0M
    2,971
    transformers
    Automatic Speech Recognition

    openai/whisper-large-v3

    4.9M
    5,626
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-russian

    4.9M
    74
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

    3.8M
    53
    transformers
    Automatic Speech Recognition

    MahmoudAshraf/mms-300m-1130-forced-aligner

    3.7M
    84
    transformers
    Automatic Speech Recognition

    pyannote/voice-activity-detection

    2.7M
    230
    pyannote-audio
    Automatic Speech Recognition

    pyannote/speaker-diarization-community-1

    2.3M
    317
    pyannote-audio
    Automatic Speech Recognition

    openai/whisper-small

    2.0M
    550
    transformers
    Automatic Speech Recognition

    facebook/mms-1b-all

    1.9M
    196
    transformers
    Automatic Speech Recognition

    Qwen/Qwen3-ASR-1.7B

    1.8M
    764
    Automatic Speech Recognition

    openai/whisper-base

    1.7M
    266
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-chinese-zh-cn

    1.5M
    132
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-polish

    1.4M
    12
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-japanese

    1.4M
    55
    transformers
    Automatic Speech Recognition

    distil-whisper/distil-large-v3

    1.3M
    376
    transformers
    Automatic Speech Recognition

    facebook/wav2vec2-base-960h

    1.2M
    396
    transformers
    Automatic Speech Recognition

    Systran/faster-whisper-tiny.en

    1.1M
    9
    ctranslate2
    Automatic Speech Recognition

    mistralai/Voxtral-Mini-4B-Realtime-2602

    1.1M
    831
    vllm
    Automatic Speech Recognition

    kresnik/wav2vec2-large-xlsr-korean

    1.0M
    55
    transformers
    Automatic Speech Recognition

    Systran/faster-whisper-tiny

    989K
    19
    ctranslate2
    Automatic Speech Recognition

    openai/whisper-tiny

    813K
    425
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-arabic

    804K
    53
    transformers
    1 / 13