command-a-plus-05-2026-bf16
by CohereLabs
218B MoE multimodal model with native citation generation for document understanding
CohereLabs/command-a-plus-05-2026-bf16mixpeek://image_extractor@v1/cohere_command_a_plus_v1Overview
Command A+ is Cohere's flagship open-weight multimodal model with 218B total parameters and 25B active per token. It processes scanned documents, charts, and technical manuals while generating structured output with native citations back to source material.
The model handles 128K context across 48 languages, making it particularly valuable for enterprise document understanding where provenance and citations are critical. The w4a4 quantized variant runs on a single B200 or two H100s.
Architecture
Mixture-of-Experts transformer with 218B total parameters, 25B active per token. Multimodal encoder handles interleaved text and image inputs. Native citation and grounding head produces source references alongside generated text. 128K context window.
Mixpeek SDK Integration
from mixpeek import Mixpeekmx = Mixpeek(api_key="YOUR_KEY")mx.ingest(collection_id="legal-docs",source="s3://contracts/",extractors=[{"type": "scene_caption","model": "CohereLabs/command-a-plus-05-2026-bf16","output_feature": "caption"},{"type": "text_embedding","model": "BAAI/bge-m3","input_field": "caption","output_feature": "caption_embedding"}])
Capabilities
- Native citation generation with source grounding
- 128K context, 48 languages
- Document understanding (scans, charts, manuals)
- MoE efficiency: 25B active / 218B total
- Apache 2.0 license
Use Cases on Mixpeek
Benchmarks
| Dataset | Metric | Score | Source |
|---|---|---|---|
| MathVista | Accuracy | 80.6% | Cohere, May 2026 |
| CharXiv Reasoning | Accuracy | 52.7% | Cohere, May 2026 |
Performance
Specification
Research Paper
Introducing Command A+
arxiv.orgBuild a pipeline with command-a-plus-05-2026-bf16
Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.
Open Studio