AI Tools Directory

    Discover and compare the best AI tools, platforms, and services across every category.

    103 tools across 10 categories

    Featured Tools

    Leading AI platforms and tools that are shaping the multimodal AI landscape.

    FEATURED
    Twelve Labs logo

    Twelve Labs

    Video AI Tools

    Video understanding platform that enables developers to build programs that can see, listen, and understand video content using multimodal AI.

    freemium
    video
    audio
    text
    FEATURED
    Mixpeek logo

    Mixpeek

    Multimodal AI Platforms

    Multimodal data infrastructure platform that indexes, processes, and retrieves across video, image, audio, and text with unified pipelines and search.

    freemium
    video
    image
    audio
    text
    FEATURED
    OpenAI logo

    OpenAI

    Multimodal AI Platforms

    AI research and deployment company behind GPT-4, DALL-E, and Whisper, providing multimodal AI models through APIs and ChatGPT.

    freemium
    text
    image
    audio
    video
    FEATURED
    Pinecone logo

    Pinecone

    Vector Databases

    Purpose-built vector database for machine learning applications, offering fully managed infrastructure for similarity search at scale.

    freemium
    text
    image
    FEATURED
    LangChain logo

    LangChain

    RAG Frameworks

    Framework for developing applications powered by language models, providing tools for chains, agents, retrieval, and memory management.

    open-source
    open source
    text
    FEATURED
    Elasticsearch logo

    Elasticsearch

    AI Search Engines

    Distributed search and analytics engine providing full-text search, vector search, and hybrid retrieval for applications at enterprise scale.

    freemium
    open source
    text

    All Tools

    Browse all 103 tools in the directory.

    Twelve Labs logo

    Twelve Labs

    Video Understanding

    Video understanding platform that enables developers to build programs that can see, listen, and understand video content using multimodal AI.

    freemium
    video
    audio
    text
    Runway logo

    Runway

    Video Generation

    Applied AI research company building creative tools powered by machine learning for video generation, editing, and multimodal content creation.

    freemium
    video
    image
    text
    Synthesia logo

    Synthesia

    Video Generation

    AI video generation platform that creates professional videos from text using digital avatars and voiceover in over 120 languages.

    paid
    video
    text
    audio
    Descript logo

    Descript

    Video Editing

    All-in-one video and audio editing platform that uses AI for transcription-based editing, screen recording, and podcasting.

    freemium
    video
    audio
    text
    Mux logo

    Mux

    Video Infrastructure

    Video infrastructure platform providing APIs for video streaming, analytics, and real-time video processing at scale.

    freemium
    video
    Pika logo

    Pika

    Video Generation

    AI-powered video generation platform that creates and edits cinematic-quality videos from text and image prompts with expressive motion control.

    freemium
    video
    image
    text
    Luma AI logo

    Luma AI

    Video Generation

    AI company building multimodal models for video generation and 3D capture, known for Dream Machine text-to-video and photorealistic 3D reconstruction.

    freemium
    video
    image
    text
    HeyGen logo

    HeyGen

    Video Generation

    AI video creation platform for generating professional talking avatar videos at scale, used for marketing, training, and personalized outreach.

    freemium
    video
    audio
    text
    Kapwing logo

    Kapwing

    Video Editing

    Online collaborative video editor with AI-powered tools for auto-captioning, background removal, repurposing long-form content into clips.

    freemium
    video
    audio
    text
    Pictory logo

    Pictory

    Video Creation

    AI video creation platform that automatically converts long-form text and blog content into short branded videos with stock footage and captions.

    paid
    video
    text
    Clarifai logo

    Clarifai

    Image Recognition

    Full lifecycle AI platform specializing in computer vision, natural language processing, and audio recognition with pre-built and custom model support.

    freemium
    image
    video
    text
    Stability AI logo

    Stability AI

    Image Generation

    Open-source AI company behind Stable Diffusion, providing image generation, upscaling, and editing models accessible to developers.

    freemium
    image
    text
    Midjourney logo

    Midjourney

    Image Generation

    AI-powered image generation tool known for producing high-quality artistic images from text prompts through a Discord-based interface.

    paid
    image
    text
    Roboflow logo

    Roboflow

    Computer Vision

    End-to-end computer vision platform for building, training, and deploying custom vision models with annotation, dataset management, and deployment tools.

    freemium
    image
    video
    Immersity AI logo

    Immersity AI

    Image Processing

    AI platform for converting 2D images into immersive 3D content, enabling depth estimation and spatial computing applications.

    freemium
    image
    Leonardo AI logo

    Leonardo AI

    Image Generation

    AI-powered creative platform for generating production-quality images and assets with fine-tuned models, widely used in game development and design.

    freemium
    image
    text
    Adobe Firefly logo

    Adobe Firefly

    Image Generation

    Adobe generative AI model family trained on licensed content, powering text-to-image, generative fill, and text effects across Creative Cloud.

    freemium
    image
    text
    PhotoRoom logo

    PhotoRoom

    Image Editing

    AI photo editing platform specializing in background removal and product photography, enabling e-commerce teams to create studio-quality images instantly.

    freemium
    image
    Remove.bg logo

    Remove.bg

    Image Processing

    AI service that automatically removes image backgrounds in seconds with high accuracy, available as a web tool, API, and desktop app.

    freemium
    image
    DeepAI logo

    DeepAI

    Image Processing

    AI platform offering a suite of image generation, enhancement, and analysis APIs including style transfer, colorization, and super resolution.

    freemium
    image
    text
    Deepgram logo

    Deepgram

    Speech-to-Text

    AI speech platform providing fast and accurate speech-to-text, text-to-speech, and audio intelligence APIs for developers.

    freemium
    audio
    text
    AssemblyAI logo

    AssemblyAI

    Speech-to-Text

    AI platform for transcription, summarization, and audio intelligence with state-of-the-art speech recognition models.

    freemium
    audio
    text
    OpenAI Whisper logo

    OpenAI Whisper

    Speech-to-Text

    Open-source automatic speech recognition system by OpenAI trained on 680K hours of multilingual data, supporting transcription and translation.

    open-source
    audio
    text
    ElevenLabs logo

    ElevenLabs

    Text-to-Speech

    AI voice technology company offering realistic text-to-speech, voice cloning, and audio content creation in multiple languages.

    freemium
    audio
    text
    1 / 5

    Building with Multimodal Data?

    Mixpeek unifies video, image, audio, and text processing into a single platform. See how it compares to the tools in this directory.