NEWAgents can now see video via MCP.Try it now →
    Back to Videos

    Multimodal Retrieval: Search Text, Images, and Video by Meaning

    0:60
    Short Form
    Ethan
    December 23, 2025

    Summary

    Search shouldn’t be limited to text. Multimodal retrieval lets you find relevant results across **text, images, and video** using shared meaning. In this video, I show how the same system can:

    short-form

    About this video

    Search shouldn’t be limited to text. Multimodal retrieval lets you find relevant results across **text, images, and video** using shared meaning. In this video, I show how the same system can: * Take a text query and find matching images * Use an image to retrieve similar images or video segments * Find related video moments based on visual and semantic similarity This is the foundation behind modern AI search, recommendation systems, and content understanding across modalities.