NEWAgents can now see video via MCP.Try it now →

    Automatic Speech Recognition Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    300 models available

    Showing 2548 of 300 models

    Automatic Speech Recognition

    pyannote/overlapped-speech-detection

    793K
    56
    pyannote-audio
    Automatic Speech Recognition

    Systran/faster-whisper-large-v3

    785K
    561
    ctranslate2
    Automatic Speech Recognition

    pyannote/speaker-diarization

    778K
    1,262
    pyannote-audio
    Automatic Speech Recognition

    openai/whisper-medium

    757K
    284
    transformers
    Automatic Speech Recognition

    nvidia/parakeet-ctc-1.1b

    745K
    46
    nemo
    Automatic Speech Recognition

    microsoft/VibeVoice-ASR

    732K
    1,048
    transformers
    Automatic Speech Recognition

    mpoyraz/wav2vec2-xls-r-300m-cv7-turkish

    668K
    14
    transformers
    Automatic Speech Recognition

    Revai/reverb-diarization-v1

    596K
    13
    pyannote-audio
    Automatic Speech Recognition

    mlx-community/parakeet-tdt-0.6b-v3

    580K
    38
    mlx
    Automatic Speech Recognition

    Systran/faster-whisper-base

    562K
    22
    ctranslate2
    Automatic Speech Recognition

    mlx-community/parakeet-tdt-0.6b-v2

    561K
    40
    mlx
    Automatic Speech Recognition

    Yehor/w2v-xls-r-uk

    531K
    8
    transformers
    Automatic Speech Recognition

    argmaxinc/speakerkit-pro

    508K
    20
    whisperkit
    Automatic Speech Recognition

    FluidInference/parakeet-tdt-0.6b-v3-coreml

    478K
    40
    nemo
    Automatic Speech Recognition

    theainerd/Wav2Vec2-large-xlsr-hindi

    455K
    12
    transformers
    Automatic Speech Recognition

    Qwen/Qwen3-ASR-0.6B

    444K
    280
    Automatic Speech Recognition

    Khalsuu/filipino-wav2vec2-l-xls-r-300m-official

    433K
    2
    transformers
    Automatic Speech Recognition

    Systran/faster-whisper-small

    418K
    31
    ctranslate2
    Automatic Speech Recognition

    nvidia/canary-1b-flash

    408K
    271
    nemo
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-dutch

    396K
    14
    transformers
    Automatic Speech Recognition

    nvidia/parakeet-tdt-0.6b-v3

    395K
    808
    nemo
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-greek

    383K
    3
    transformers
    Automatic Speech Recognition

    microsoft/Phi-4-multimodal-instruct

    367K
    1,593
    transformers
    Automatic Speech Recognition

    NbAiLab/nb-wav2vec2-1b-nynorsk

    366K
    transformers
    2 / 13