NEWVectors or files. Pick a path.Start →

    Automatic Speech Recognition Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    372 models available

    Showing 2548 of 372 models

    Automatic Speech Recognition

    nvidia/parakeet-ctc-1.1b

    889K
    49
    nemo
    Automatic Speech Recognition

    Qwen/Qwen3-ASR-0.6B

    872K
    298
    Automatic Speech Recognition

    pyannote/speaker-diarization

    821K
    1,279
    pyannote-audio
    Automatic Speech Recognition

    indonesian-nlp/wav2vec2-indonesian-javanese-sundanese

    786K
    15
    transformers
    Automatic Speech Recognition

    airesearch/wav2vec2-large-xlsr-53-th

    709K
    28
    transformers
    Automatic Speech Recognition

    facebook/wav2vec2-lv-60-espeak-cv-ft

    686K
    69
    transformers
    Automatic Speech Recognition

    kresnik/wav2vec2-large-xlsr-korean

    610K
    56
    transformers
    Automatic Speech Recognition

    ibm-granite/granite-speech-4.1-2b

    605K
    127
    transformers
    Automatic Speech Recognition

    facebook/hubert-large-ls960-ft

    581K
    76
    transformers
    Automatic Speech Recognition

    microsoft/VibeVoice-ASR

    555K
    1,168
    transformers
    Automatic Speech Recognition

    AbelZimba/whisper-bemba-stt

    554K
    transformers
    Automatic Speech Recognition

    openai/whisper-medium

    531K
    284
    transformers
    Automatic Speech Recognition

    microsoft/Phi-4-multimodal-instruct

    529K
    1,601
    transformers
    Automatic Speech Recognition

    Revai/reverb-diarization-v1

    526K
    13
    pyannote-audio
    Automatic Speech Recognition

    CohereLabs/cohere-transcribe-03-2026

    509K
    977
    transformers
    Automatic Speech Recognition

    ibm-granite/granite-speech-3.3-2b

    501K
    55
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-dutch

    493K
    15
    transformers
    Automatic Speech Recognition

    Yehor/w2v-xls-r-uk

    480K
    8
    transformers
    Automatic Speech Recognition

    mlx-community/parakeet-tdt-0.6b-v2

    455K
    42
    mlx
    Automatic Speech Recognition

    Systran/faster-whisper-tiny

    443K
    21
    ctranslate2
    Automatic Speech Recognition

    Qwen/Qwen3-ForcedAligner-0.6B

    443K
    139
    Automatic Speech Recognition

    Systran/faster-whisper-medium

    441K
    46
    ctranslate2
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-german

    434K
    8
    transformers
    Automatic Speech Recognition

    nvidia/parakeet-tdt-0.6b-v2

    365K
    1,490
    nemo
    2 / 16