MinerU-Diffusion-V1-0320-2.5B

Name: MinerU-Diffusion-V1-0320-2.5B
Author: opendatalab

by opendatalab

4Kdl/month

22likes

HuggingFace Use in Pipeline

Identifier

Model ID

opendatalab/MinerU-Diffusion-V1-0320-2.5B

Use MinerU-Diffusion-V1-0320-2.5B on Mixpeek

Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

Open Pipeline Builder

Specification

Organizationopendatalab

TaskImage To Text

Librarytransformers

Licensemit

Downloads/mo4K

Likes22

View on HuggingFace

See model card, files, and community discussion

Related Image To Text Models

zai-org/GLM-OCR

7.9M

Salesforce/blip-image-captioning-base

2.1M

Salesforce/blip-image-captioning-large

1.3M

microsoft/trocr-base-printed

734K

breezedeus/pix2text-mfr

633K

Salesforce/blip2-opt-2.7b-coco

605K

PaddlePaddle/PP-OCRv5_server_det

583K

PaddlePaddle/UVDoc

404K

MinerU-Diffusion-V1-0320-2.5B

Tags

Use MinerU-Diffusion-V1-0320-2.5B on Mixpeek

Specification

View on HuggingFace

Related Image To Text Models