Subcategories:
Showing 10 of 10 tools
Deepgram
Speech-to-Text
AI speech platform providing fast and accurate speech-to-text, text-to-speech, and audio intelligence APIs for developers.
Key features:
AssemblyAI
Speech-to-Text
AI platform for transcription, summarization, and audio intelligence with state-of-the-art speech recognition models.
Key features:
OpenAI Whisper
Speech-to-Text
Open-source automatic speech recognition system by OpenAI trained on 680K hours of multilingual data, supporting transcription and translation.
Key features:
ElevenLabs
Text-to-Speech
AI voice technology company offering realistic text-to-speech, voice cloning, and audio content creation in multiple languages.
Key features:
Speechmatics
Speech-to-Text
Enterprise speech technology providing highly accurate speech recognition across 50+ languages with real-time and batch processing.
Key features:
Resemble AI
Voice Synthesis
AI voice generator providing real-time speech synthesis, voice cloning, and neural audio editing for creating custom synthetic voices.
Key features:
Murf AI
Text-to-Speech
AI voice generator platform offering lifelike text-to-speech with over 120 voices in 20+ languages for videos, presentations, and e-learning.
Key features:
Play.ht
Text-to-Speech
AI voice generation platform with ultra-realistic text-to-speech, voice cloning from short samples, and an API for building voice applications.
Key features:
Otter.ai
Transcription
AI meeting assistant that provides real-time transcription, automated meeting notes, action items, and searchable conversation records.
Key features:
Rev AI
Speech-to-Text
Speech-to-text API platform providing highly accurate transcription, real-time streaming, and topic extraction for developers and enterprises.
Key features:
Explore Other Categories
Need a Multimodal Solution?
Mixpeek processes video, image, audio, and text through unified pipelines. See how it compares to the tools listed in this directory.
