The use of AI to translate text from one natural language to another while preserving meaning. Machine translation enables multilingual content processing and cross-lingual search in global multimodal systems.
Neural machine translation uses encoder-decoder transformer models to translate text. The encoder processes the source language sentence into contextualized representations, and the decoder generates the target language sentence token by token. Attention mechanisms align source and target tokens. Modern systems handle over 100 languages and produce near-human quality for well-resourced language pairs.
State-of-the-art systems include NLLB (No Language Left Behind, 200 languages), mBART, and M2M-100. Commercial APIs (Google Translate, DeepL) use proprietary large-scale models. Multilingual models share parameters across languages, enabling zero-shot translation between unseen language pairs. Quality is measured using BLEU, chrF, and COMET scores, with human evaluation for production systems.
Connect a bucket and Mixpeek runs the whole multimodal search pipeline for you: extraction, indexing, and search over your own objects. No models to wire up, nothing to host.
Start with ManagedKeep your embeddings on your own cloud and run dense, sparse, and BM25 search directly on object storage. First 1M vectors free.
Start with MVS