4Kdl/month
22likes
Identifier
Model ID
opendatalab/MinerU-Diffusion-V1-0320-2.5BTags
transformerssafetensorsmineru_diffusionfeature-extractionocrdocument-understandingvision-language-modelmultimodaltrust-remote-codemineruimage-to-textcustom_codearxiv:2603.22458arxiv:2406.07524arxiv:2410.17891arxiv:2503.09573arxiv:2509.22186arxiv:2409.18839arxiv:2407.13773license:mitregion:us
Use MinerU-Diffusion-V1-0320-2.5B on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationopendatalab
TaskImage To Text
Librarytransformers
Licensemit
Downloads/mo4K
Likes22
View on HuggingFace
See model card, files, and community discussion