NEWAgents can now see video via MCP.Try it now →

    Automatic Speech Recognition Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 265288 of 300 models

    Automatic Speech Recognition

    guillaumekln/faster-whisper-base

    5K
    10
    ctranslate2
    Automatic Speech Recognition

    aufklarer/Qwen3-ASR-1.7B-MLX-4bit

    5K
    1
    mlx
    Automatic Speech Recognition

    kotoba-tech/kotoba-whisper-v2.0

    5K
    90
    transformers
    Automatic Speech Recognition

    vinai/PhoWhisper-large

    5K
    41
    transformers
    Automatic Speech Recognition

    facebook/s2t-small-mustc-en-fr-st

    5K
    2
    transformers
    Automatic Speech Recognition

    BELLE-2/Belle-whisper-large-v3-zh-punct

    5K
    47
    transformers
    Automatic Speech Recognition

    pyannote-community/speaker-diarization-community-1

    5K
    4
    pyannote-audio
    Automatic Speech Recognition

    nvidia/diar_sortformer_4spk-v1

    5K
    137
    nemo
    Automatic Speech Recognition

    facebook/wav2vec2-large-robust-ft-swbd-300h

    5K
    20
    transformers
    Automatic Speech Recognition

    TalTechNLP/whisper-large-v3-turbo-et-verbatim

    5K
    3
    transformers
    Automatic Speech Recognition

    Oriserve/Whisper-Hindi2Hinglish-Apex

    4K
    7
    transformers
    Automatic Speech Recognition

    unsloth/whisper-large-v3

    4K
    15
    Automatic Speech Recognition

    nguyenvulebinh/wav2vec2-base-vietnamese-250h

    4K
    45
    transformers
    Automatic Speech Recognition

    Harveenchadha/vakyansh-wav2vec2-sanskrit-sam-60

    4K
    4
    transformers
    Automatic Speech Recognition

    vitouphy/wav2vec2-xls-r-300m-timit-phoneme

    4K
    32
    transformers
    Automatic Speech Recognition

    neurlang/ipa-whisper-medium

    4K
    Automatic Speech Recognition

    nvidia/stt_en_fastconformer_hybrid_large_pc

    4K
    3
    nemo
    Automatic Speech Recognition

    ivrit-ai/whisper-large-v3-ct2

    4K
    15
    ctranslate2
    Automatic Speech Recognition

    cahya/wav2vec2-large-xlsr-indonesian

    4K
    transformers
    Automatic Speech Recognition

    cstr/cohere-transcribe-03-2026-GGUF

    4K
    4
    Automatic Speech Recognition

    BarathwajAnandan/cohere-transcribe-03-2026-CoreML-6bit

    4K
    3
    coreml
    Automatic Speech Recognition

    facebook/wav2vec2-large-xlsr-53-german

    4K
    4
    transformers
    Automatic Speech Recognition

    nvidia/parakeet-tdt_ctc-1.1b

    4K
    23
    nemo
    Automatic Speech Recognition

    ibm-granite/granite-speech-3.2-8b

    4K
    86
    transformers
    12 / 13