Mixpeek Logo
    data pipeline

    Databricks Connector

    Lakehouse AI integration for multimodal data

    Bring Mixpeek's multimodal processing into the Databricks Lakehouse with native notebook support and Delta Lake integration. Enrich tables at scale, generate embeddings for Unity Catalog assets, and build feature tables for downstream ML models.

    databricks
    lakehouse
    delta lake
    MLflow
    feature store
    unity catalog
    Quick Install:
    npm install @mixpeek/databricks

    Use Cases

    1

    Building multimodal feature stores on Delta Lake

    2

    Enriching Unity Catalog datasets with AI signals

    3

    Training ML models on Mixpeek-generated embeddings

    4

    Batch classification of unstructured lakehouse data

    Features

    Delta Lake reader and writer for enriched data
    MLflow integration for experiment tracking
    Unity Catalog compatible enrichment functions
    Notebook-friendly API with display helpers
    Cluster-aware parallel processing

    Get Started

    Integrations

    Databricks
    Delta Lake
    MLflow
    Unity Catalog

    Details

    LicenseApache 2.0
    Categorydata pipeline
    Registrynpm

    Quick Info

    LicenseApache 2.0
    Categorydata pipeline
    Registrynpm

    Ready to integrate?

    Get started with Databricks Connector in minutes. Check out the documentation or explore the source code on GitHub.