Mixpeek Logo

    Multimodal AI Platforms

    Platforms that handle multiple data types

    6 tools listed

    Back to Directory

    Subcategories:

    Foundation Models (2)ML Platform (2)Multimodal Infrastructure (1)Visual Data (1)

    Showing 6 of 6 tools

    Mixpeek logo

    Mixpeek

    Multimodal Infrastructure

    Multimodal data infrastructure platform that indexes, processes, and retrieves across video, image, audio, and text with unified pipelines and search.

    freemium
    video
    image
    audio
    text

    Key features:

    Multimodal indexingFeature extractionUnified search+2 more
    OpenAI logo

    OpenAI

    Foundation Models

    AI research and deployment company behind GPT-4, DALL-E, and Whisper, providing multimodal AI models through APIs and ChatGPT.

    freemium
    text
    image
    audio
    video

    Key features:

    GPT-4 VisionDALL-E image generationWhisper transcription+2 more
    Google Vertex AI logo

    Google Vertex AI

    ML Platform

    Google Cloud ML platform providing access to Gemini models, AutoML, and custom training for building multimodal AI applications at scale.

    paid
    text
    image
    audio
    video

    Key features:

    Gemini modelsAutoMLCustom training+2 more
    Anthropic logo

    Anthropic

    Foundation Models

    AI safety company providing Claude, a multimodal AI assistant capable of analyzing text, images, and code with a focus on helpfulness and safety.

    freemium
    text
    image

    Key features:

    Claude modelsVision analysisLong context windows+2 more
    Coactive AI logo

    Coactive AI

    Visual Data

    Visual data platform that enables teams to search, analyze, and organize image and video content using multimodal AI understanding.

    enterprise
    image
    video

    Key features:

    Visual searchContent taggingBrand monitoring+2 more
    Amazon Bedrock logo

    Amazon Bedrock

    ML Platform

    Fully managed service from AWS providing access to foundation models from leading AI companies for building generative AI applications.

    paid
    text
    image

    Key features:

    Model selectionFine-tuningRAG support+2 more

    Need a Multimodal Solution?

    Mixpeek processes video, image, audio, and text through unified pipelines. See how it compares to the tools listed in this directory.