AI Tools Directory
Discover and compare the best AI tools, platforms, and services across every category.
103 tools across 10 categories
Browse by Category
Explore AI tools organized by function and capability to find the right solution for your needs.
Video AI Tools
Tools for video analysis, search, and processing
Image AI Tools
Tools for image recognition, search, and generation
Audio AI Tools
Tools for speech, audio processing, and transcription
Document AI Tools
Tools for document processing and extraction
Multimodal AI Platforms
Platforms that handle multiple data types
Vector Databases
Databases optimized for vector similarity search
Embedding Models
Models for generating vector embeddings
RAG Frameworks
Frameworks for retrieval augmented generation
Content Moderation
Tools for automated content moderation
AI Search Engines
AI-powered search and retrieval engines
Featured Tools
Leading AI platforms and tools that are shaping the multimodal AI landscape.
Twelve Labs
Video AI Tools
Video understanding platform that enables developers to build programs that can see, listen, and understand video content using multimodal AI.
Mixpeek
Multimodal AI Platforms
Multimodal data infrastructure platform that indexes, processes, and retrieves across video, image, audio, and text with unified pipelines and search.
OpenAI
Multimodal AI Platforms
AI research and deployment company behind GPT-4, DALL-E, and Whisper, providing multimodal AI models through APIs and ChatGPT.
Pinecone
Vector Databases
Purpose-built vector database for machine learning applications, offering fully managed infrastructure for similarity search at scale.
LangChain
RAG Frameworks
Framework for developing applications powered by language models, providing tools for chains, agents, retrieval, and memory management.
Elasticsearch
AI Search Engines
Distributed search and analytics engine providing full-text search, vector search, and hybrid retrieval for applications at enterprise scale.
All Tools
Browse all 103 tools in the directory.
Twelve Labs
Video Understanding
Video understanding platform that enables developers to build programs that can see, listen, and understand video content using multimodal AI.
Runway
Video Generation
Applied AI research company building creative tools powered by machine learning for video generation, editing, and multimodal content creation.
Synthesia
Video Generation
AI video generation platform that creates professional videos from text using digital avatars and voiceover in over 120 languages.
Descript
Video Editing
All-in-one video and audio editing platform that uses AI for transcription-based editing, screen recording, and podcasting.
Mux
Video Infrastructure
Video infrastructure platform providing APIs for video streaming, analytics, and real-time video processing at scale.
Pika
Video Generation
AI-powered video generation platform that creates and edits cinematic-quality videos from text and image prompts with expressive motion control.
Luma AI
Video Generation
AI company building multimodal models for video generation and 3D capture, known for Dream Machine text-to-video and photorealistic 3D reconstruction.
HeyGen
Video Generation
AI video creation platform for generating professional talking avatar videos at scale, used for marketing, training, and personalized outreach.
Kapwing
Video Editing
Online collaborative video editor with AI-powered tools for auto-captioning, background removal, repurposing long-form content into clips.
Pictory
Video Creation
AI video creation platform that automatically converts long-form text and blog content into short branded videos with stock footage and captions.
Clarifai
Image Recognition
Full lifecycle AI platform specializing in computer vision, natural language processing, and audio recognition with pre-built and custom model support.
Stability AI
Image Generation
Open-source AI company behind Stable Diffusion, providing image generation, upscaling, and editing models accessible to developers.
Midjourney
Image Generation
AI-powered image generation tool known for producing high-quality artistic images from text prompts through a Discord-based interface.
Roboflow
Computer Vision
End-to-end computer vision platform for building, training, and deploying custom vision models with annotation, dataset management, and deployment tools.
Immersity AI
Image Processing
AI platform for converting 2D images into immersive 3D content, enabling depth estimation and spatial computing applications.
Leonardo AI
Image Generation
AI-powered creative platform for generating production-quality images and assets with fine-tuned models, widely used in game development and design.
Adobe Firefly
Image Generation
Adobe generative AI model family trained on licensed content, powering text-to-image, generative fill, and text effects across Creative Cloud.
PhotoRoom
Image Editing
AI photo editing platform specializing in background removal and product photography, enabling e-commerce teams to create studio-quality images instantly.
Remove.bg
Image Processing
AI service that automatically removes image backgrounds in seconds with high accuracy, available as a web tool, API, and desktop app.
DeepAI
Image Processing
AI platform offering a suite of image generation, enhancement, and analysis APIs including style transfer, colorization, and super resolution.
Deepgram
Speech-to-Text
AI speech platform providing fast and accurate speech-to-text, text-to-speech, and audio intelligence APIs for developers.
AssemblyAI
Speech-to-Text
AI platform for transcription, summarization, and audio intelligence with state-of-the-art speech recognition models.
OpenAI Whisper
Speech-to-Text
Open-source automatic speech recognition system by OpenAI trained on 680K hours of multilingual data, supporting transcription and translation.
ElevenLabs
Text-to-Speech
AI voice technology company offering realistic text-to-speech, voice cloning, and audio content creation in multiple languages.
Building with Multimodal Data?
Mixpeek unifies video, image, audio, and text processing into a single platform. See how it compares to the tools in this directory.
