NEWVectors or files. Pick a path.Start →

    Automatic Speech Recognition Models

    Browse AI models for multimodal decomposition and recomposition pipelines — plug any model into your extractors.

    372 models available

    Showing 124 of 372 models

    Automatic Speech Recognition

    argmaxinc/whisperkit-coreml

    9.5M
    189
    whisperkit
    Automatic Speech Recognition

    pyannote/speaker-diarization-3.1

    9.2M
    2,196
    pyannote-audio
    Automatic Speech Recognition

    openai/whisper-large-v3-turbo

    8.6M
    3,067
    transformers
    Automatic Speech Recognition

    openai/whisper-large-v3

    5.4M
    5,785
    transformers
    Automatic Speech Recognition

    openai/whisper-base

    4.2M
    271
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-russian

    3.5M
    75
    transformers
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-portuguese

    2.9M
    54
    transformers
    Automatic Speech Recognition

    pyannote/speaker-diarization-community-1

    2.9M
    503
    pyannote-audio
    Automatic Speech Recognition

    MahmoudAshraf/mms-300m-1130-forced-aligner

    2.8M
    91
    transformers
    Automatic Speech Recognition

    pyannote/voice-activity-detection

    2.8M
    236
    pyannote-audio
    Automatic Speech Recognition

    openai/whisper-small

    2.4M
    564
    transformers
    Automatic Speech Recognition

    Qwen/Qwen3-ASR-1.7B

    1.7M
    861
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-polish

    1.5M
    12
    transformers
    Automatic Speech Recognition

    openai/whisper-tiny

    1.4M
    432
    transformers
    Automatic Speech Recognition

    Systran/faster-whisper-base

    1.4M
    28
    ctranslate2
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-japanese

    1.3M
    57
    transformers
    Automatic Speech Recognition

    mlx-community/parakeet-tdt-0.6b-v3

    1.3M
    44
    mlx
    Automatic Speech Recognition

    facebook/wav2vec2-base-960h

    1.2M
    398
    transformers
    Automatic Speech Recognition

    mistralai/Voxtral-Mini-4B-Realtime-2602

    1.2M
    871
    vllm
    Automatic Speech Recognition

    Systran/faster-whisper-tiny.en

    1.1M
    10
    ctranslate2
    Automatic Speech Recognition

    jonatasgrosman/wav2vec2-large-xlsr-53-chinese-zh-cn

    1.1M
    134
    transformers
    Automatic Speech Recognition

    Systran/faster-whisper-large-v3

    1.1M
    590
    ctranslate2
    Automatic Speech Recognition

    distil-whisper/distil-large-v3

    976K
    375
    transformers
    Automatic Speech Recognition

    Systran/faster-whisper-small

    946K
    34
    ctranslate2
    1 / 16