Mixpeek Logo

    Audio AI Tools

    Tools for speech, audio processing, and transcription

    5 tools listed

    Back to Directory

    Subcategories:

    Speech-to-Text (4)Text-to-Speech (1)

    Showing 5 of 5 tools

    Deepgram logo

    Deepgram

    Speech-to-Text

    AI speech platform providing fast and accurate speech-to-text, text-to-speech, and audio intelligence APIs for developers.

    freemium
    audio
    text

    Key features:

    Speech-to-textText-to-speechAudio intelligence+2 more
    AssemblyAI logo

    AssemblyAI

    Speech-to-Text

    AI platform for transcription, summarization, and audio intelligence with state-of-the-art speech recognition models.

    freemium
    audio
    text

    Key features:

    TranscriptionSummarizationSentiment analysis+2 more
    OpenAI Whisper logo

    OpenAI Whisper

    Speech-to-Text

    Open-source automatic speech recognition system by OpenAI trained on 680K hours of multilingual data, supporting transcription and translation.

    open-source
    open source
    audio
    text

    Key features:

    Multilingual transcriptionTranslationLanguage detection+2 more
    ElevenLabs logo

    ElevenLabs

    Text-to-Speech

    AI voice technology company offering realistic text-to-speech, voice cloning, and audio content creation in multiple languages.

    freemium
    audio
    text

    Key features:

    Text-to-speechVoice cloningVoice design+2 more
    Speechmatics logo

    Speechmatics

    Speech-to-Text

    Enterprise speech technology providing highly accurate speech recognition across 50+ languages with real-time and batch processing.

    enterprise
    audio
    text

    Key features:

    Real-time transcriptionBatch transcriptionLanguage pack support+2 more

    Need a Multimodal Solution?

    Mixpeek processes video, image, audio, and text through unified pipelines. See how it compares to the tools listed in this directory.