Video Learning Hub

Master multimodal AI concepts through comprehensive tutorials, guides, and best practices from our expert team.

Trusted by engineers at

NVIDIA

MongoDB

AWS

Google

Microsoft

Apple

Meta

All(11)

Release Guides(3)

Multimodal Monday(1)

Multimodal University(6)

Use Cases(1)

Mixpeek Taxonomies Guide

Mixpeek Taxonomies Guide

Learn how to organize and classify your multimodal content using Mixpeek's new Taxonomies feature. This guide walks you through creating hierarchical classifications that understand both visual and textual content using powerful embedding models. What you'll learn: ⚡ Creating custom taxonomies ⚡ Implementing multimodal embeddings ⚡ Automatic content classification ⚡ Hierarchical search capabilities ⚡ Best practices for video taxonomies

Mixpeek Namespaces Guide

Mixpeek Namespaces Guide

Learn how to create and manage isolated environments for your search applications using Mixpeek's Namespaces feature. This guide covers organizing use cases, applications, or environments with independent configurations and security controls. What you'll learn: ⚡ Logical separation for different applications ⚡ Configuring vector and payload indexes ⚡ Best practices for organizing namespaces ⚡ Implementing environment and client isolation ⚡ Naming conventions for efficient management

Mixpeek Collections Guide

Mixpeek Collections Guide

Learn how to effectively organize your multimodal content using Mixpeek Collections! This guide covers everything you need to know about creating and managing content groups for optimized search and retrieval. What you'll learn: ⚡ Collection creation and management ⚡ Content organization strategies ⚡ Namespace isolation ⚡ Access patterns and permissions ⚡ Performance optimization tips Key highlights: Create logical content groupings Scale to millions of assets Optimize search configurations Manage access controls

Multimodal Monday #2 — From Tiny VLMs to 10M‑Token Titans

Multimodal Monday #2 — From Tiny VLMs to 10M‑Token Titans

This week in multimodal AI was wild — we're talking: Meta's Llama 4 with 10M-token context windows Microsoft's Phi-4-Multimodal outperforming much larger models Hugging Face's SmolVLM that runs on less than 1GB RAM Poisoned image attacks on retrieval-augmented generation (!) We'll break down the latest research, tools, real-world use cases, and what it all means for developers, founders, and builders in the AI space. ⏱ Timestamps: 00:00 – Welcome 00:25 – Quick Take 01:05 – Research Highlights 02:10 – Tools & Techniques 03:00 – Real-World Applications 03:40 – Trends & Predictions 04:30 – Community & Shoutouts 04:55 – Wrap-up

Multimodal Monday

Introduction to Multimodal University

Introduction to Multimodal University

Join Ethan, former MongoDB Search Engineer and CEO of Mixpeek, as he introduces you to the world of multimodal AI development. Learn what you'll master in this comprehensive course and why multimodal understanding is crucial for modern applications.

Multimodal University

What is Multimodal Understanding?

What is Multimodal Understanding?

Discover how computers can understand multiple types of data simultaneously. This lesson breaks down the core concepts of multimodal systems and their real-world applications. Learn how text, images, audio, and video work together in modern AI systems.

Multimodal University

Fundamental Multimodal Concepts

Fundamental Multimodal Concepts

Dive into the building blocks of multimodal systems. Learn about data representation, feature extraction, neural networks, and information retrieval. Essential concepts for anyone looking to build sophisticated AI applications.

Multimodal University

Text Understanding Fundamentals

Text Understanding Fundamentals

Master the basics of natural language processing. From tokenization to word embeddings, learn how machines understand and process text data. Includes practical examples and real-world applications.

Multimodal University

Audio Understanding Fundamentals

Audio Understanding Fundamentals

Explore how machines process and understand audio signals. Learn about waveforms, feature extraction, and common audio processing tasks. Essential knowledge for speech recognition and audio analysis.

Multimodal University