Models trained on large-scale multimodal datasets (e.g., CLIP, Flamingo, Gemini) used for feature extraction, search, and analysis.
Pretrained models are trained on extensive datasets to learn general features and patterns. They can be fine-tuned for specific tasks, providing a foundation for feature extraction, search, and analysis in multimodal systems.
Pretrained models use architectures like transformers and convolutional neural networks to learn from diverse data. They can be adapted to new tasks through fine-tuning, transfer learning, or multimodal extensions.
Connect a bucket and Mixpeek runs the whole multimodal search pipeline for you: extraction, indexing, and search over your own objects. No models to wire up, nothing to host.
Start with ManagedKeep your embeddings on your own cloud and run dense, sparse, and BM25 search directly on object storage. First 1M vectors free.
Start with MVS