NEWAgents can now see video via MCP.Try it now →

    Automatic Speech Recognition Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 4972 of 300 models

    Automatic Speech Recognition

    facebook/wav2vec2-xlsr-53-espeak-cv-ft

    363K
    49
    transformers
    Automatic Speech Recognition

    lgris/wav2vec2-large-xlsr-open-brazilian-portuguese-v2

    356K
    20
    transformers
    Automatic Speech Recognition

    freds0/distil-whisper-large-v3-ptbr

    352K
    16
    Automatic Speech Recognition

    Qwen/Qwen3-ForcedAligner-0.6B

    345K
    121
    Automatic Speech Recognition

    argmaxinc/parakeetkit-pro

    344K
    4
    whisperkit
    Automatic Speech Recognition

    pyannote/speaker-diarization-3.0

    335K
    214
    pyannote-audio
    Automatic Speech Recognition

    CohereLabs/cohere-transcribe-03-2026

    304K
    909
    transformers
    Automatic Speech Recognition

    kingabzpro/wav2vec2-large-xls-r-300m-Urdu

    301K
    13
    transformers
    Automatic Speech Recognition

    t-tech/T-one

    300K
    90
    Automatic Speech Recognition

    ibm-granite/granite-speech-3.3-2b

    279K
    53
    transformers
    Automatic Speech Recognition

    airesearch/wav2vec2-large-xlsr-53-th

    270K
    27
    transformers
    Automatic Speech Recognition

    facebook/wav2vec2-conformer-rope-large-960h-ft

    248K
    10
    transformers
    Automatic Speech Recognition

    facebook/wav2vec2-lv-60-espeak-cv-ft

    245K
    67
    transformers
    Automatic Speech Recognition

    indonesian-nlp/wav2vec2-indonesian-javanese-sundanese

    230K
    12
    transformers
    Automatic Speech Recognition

    tristayqc/my_zh_CN_asr_cv13_model

    230K
    transformers
    Automatic Speech Recognition

    Systran/faster-whisper-medium

    215K
    40
    ctranslate2
    Automatic Speech Recognition

    facebook/hubert-large-ls960-ft

    206K
    76
    transformers
    Automatic Speech Recognition

    gigant/romanian-wav2vec2

    202K
    6
    transformers
    Automatic Speech Recognition

    comodoro/wav2vec2-xls-r-300m-cs-250

    186K
    3
    transformers
    Automatic Speech Recognition

    nvidia/parakeet-tdt-0.6b-v2

    173K
    1,465
    nemo
    Automatic Speech Recognition

    vasista22/whisper-tamil-small

    170K
    4
    transformers
    Automatic Speech Recognition

    nvidia/canary-1b-v2

    169K
    378
    nemo
    Automatic Speech Recognition

    jbetker/wav2vec2-large-robust-ft-libritts-voxpopuli

    163K
    8
    transformers
    Automatic Speech Recognition

    Vikhrmodels/Borealis

    133K
    54
    transformers
    3 / 13