NEWAgents can now see video via MCP.Try it now →
    Models/Image To Text/opendatalab/MinerU-Diffusion-V1-0320-2.5B
    Image To Texttransformersmit

    MinerU-Diffusion-V1-0320-2.5B

    by opendatalab

    Identifier
    Model ID
    opendatalab/MinerU-Diffusion-V1-0320-2.5B

    Tags

    transformerssafetensorsmineru_diffusionfeature-extractionocrdocument-understandingvision-language-modelmultimodaltrust-remote-codemineruimage-to-textcustom_codearxiv:2603.22458arxiv:2406.07524arxiv:2410.17891arxiv:2503.09573arxiv:2509.22186arxiv:2409.18839arxiv:2407.13773license:mitregion:us

    Use MinerU-Diffusion-V1-0320-2.5B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder