Audio AI Tools

Tools for speech, audio processing, and transcription

10 tools listed

Back to Directory

Subcategories:

Speech-to-Text (5)Text-to-Speech (3)Voice Synthesis (1)Transcription (1)

Showing 10 of 10 tools

Deepgram

Speech-to-Text

AI speech platform providing fast and accurate speech-to-text, text-to-speech, and audio intelligence APIs for developers.

freemium

audio

text

Key features:

Speech-to-textText-to-speechAudio intelligence+2 more

Visit Website

AssemblyAI

Speech-to-Text

AI platform for transcription, summarization, and audio intelligence with state-of-the-art speech recognition models.

freemium

audio

text

Key features:

TranscriptionSummarizationSentiment analysis+2 more

Visit Website

OpenAI Whisper

Speech-to-Text

Open-source automatic speech recognition system by OpenAI trained on 680K hours of multilingual data, supporting transcription and translation.

open-source

open source

audio

text

Key features:

Multilingual transcriptionTranslationLanguage detection+2 more

Visit Website

ElevenLabs

Text-to-Speech

AI voice technology company offering realistic text-to-speech, voice cloning, and audio content creation in multiple languages.

freemium

audio

text

Key features:

Text-to-speechVoice cloningVoice design+2 more

Visit Website

Speechmatics

Speech-to-Text

Enterprise speech technology providing highly accurate speech recognition across 50+ languages with real-time and batch processing.

enterprise

audio

text

Key features:

Real-time transcriptionBatch transcriptionLanguage pack support+2 more

Visit Website

Resemble AI

Voice Synthesis

AI voice generator providing real-time speech synthesis, voice cloning, and neural audio editing for creating custom synthetic voices.

freemium

audio

text

Key features:

Voice cloningReal-time synthesisNeural audio editing+2 more

Visit Website

Murf AI

Text-to-Speech

AI voice generator platform offering lifelike text-to-speech with over 120 voices in 20+ languages for videos, presentations, and e-learning.

freemium

audio

text

Key features:

Text-to-speechVoice changerAI dubbing+2 more

Visit Website

Play.ht

Text-to-Speech

AI voice generation platform with ultra-realistic text-to-speech, voice cloning from short samples, and an API for building voice applications.

freemium

audio

text

Key features:

Text-to-speechVoice cloningStreaming API+2 more

Visit Website

Otter.ai

Transcription

AI meeting assistant that provides real-time transcription, automated meeting notes, action items, and searchable conversation records.

freemium

audio

text

Key features:

Real-time transcriptionMeeting summariesAction items+2 more

Visit Website

Rev AI

Speech-to-Text

Speech-to-text API platform providing highly accurate transcription, real-time streaming, and topic extraction for developers and enterprises.

freemium

audio

text

Key features:

Async transcriptionReal-time streamingTopic extraction+2 more

Visit Website

Explore Other Categories

Video AI Tools

Tools for video analysis, search, and processing

10 tools

Image AI Tools

Tools for image recognition, search, and generation

10 tools

Document AI Tools

Tools for document processing and extraction

10 tools

Multimodal AI Platforms

Platforms that handle multiple data types

10 tools

Need a Multimodal Solution?

Mixpeek processes video, image, audio, and text through unified pipelines. See how it compares to the tools listed in this directory.

Book a Demo View Comparisons