Mixpeek Logo
    Login / Signup

    Audio AI Tools

    Tools for speech, audio processing, and transcription

    10 tools listed

    Back to Directory

    Subcategories:

    Speech-to-Text (5)Text-to-Speech (3)Voice Synthesis (1)Transcription (1)

    Showing 10 of 10 tools

    Deepgram logo

    Deepgram

    Speech-to-Text

    AI speech platform providing fast and accurate speech-to-text, text-to-speech, and audio intelligence APIs for developers.

    freemium
    audio
    text

    Key features:

    Speech-to-textText-to-speechAudio intelligence+2 more
    AssemblyAI logo

    AssemblyAI

    Speech-to-Text

    AI platform for transcription, summarization, and audio intelligence with state-of-the-art speech recognition models.

    freemium
    audio
    text

    Key features:

    TranscriptionSummarizationSentiment analysis+2 more
    OpenAI Whisper logo

    OpenAI Whisper

    Speech-to-Text

    Open-source automatic speech recognition system by OpenAI trained on 680K hours of multilingual data, supporting transcription and translation.

    open-source
    open source
    audio
    text

    Key features:

    Multilingual transcriptionTranslationLanguage detection+2 more
    ElevenLabs logo

    ElevenLabs

    Text-to-Speech

    AI voice technology company offering realistic text-to-speech, voice cloning, and audio content creation in multiple languages.

    freemium
    audio
    text

    Key features:

    Text-to-speechVoice cloningVoice design+2 more
    Speechmatics logo

    Speechmatics

    Speech-to-Text

    Enterprise speech technology providing highly accurate speech recognition across 50+ languages with real-time and batch processing.

    enterprise
    audio
    text

    Key features:

    Real-time transcriptionBatch transcriptionLanguage pack support+2 more
    Resemble AI logo

    Resemble AI

    Voice Synthesis

    AI voice generator providing real-time speech synthesis, voice cloning, and neural audio editing for creating custom synthetic voices.

    freemium
    audio
    text

    Key features:

    Voice cloningReal-time synthesisNeural audio editing+2 more
    Murf AI logo

    Murf AI

    Text-to-Speech

    AI voice generator platform offering lifelike text-to-speech with over 120 voices in 20+ languages for videos, presentations, and e-learning.

    freemium
    audio
    text

    Key features:

    Text-to-speechVoice changerAI dubbing+2 more
    Play.ht logo

    Play.ht

    Text-to-Speech

    AI voice generation platform with ultra-realistic text-to-speech, voice cloning from short samples, and an API for building voice applications.

    freemium
    audio
    text

    Key features:

    Text-to-speechVoice cloningStreaming API+2 more
    Otter.ai logo

    Otter.ai

    Transcription

    AI meeting assistant that provides real-time transcription, automated meeting notes, action items, and searchable conversation records.

    freemium
    audio
    text

    Key features:

    Real-time transcriptionMeeting summariesAction items+2 more
    Rev AI logo

    Rev AI

    Speech-to-Text

    Speech-to-text API platform providing highly accurate transcription, real-time streaming, and topic extraction for developers and enterprises.

    freemium
    audio
    text

    Key features:

    Async transcriptionReal-time streamingTopic extraction+2 more

    Need a Multimodal Solution?

    Mixpeek processes video, image, audio, and text through unified pipelines. See how it compares to the tools listed in this directory.