14Kdl/month
275likes
Identifier
Model ID
naver-clova-ix/donut-base-finetuned-docvqaTags
transformerspytorchvision-encoder-decoderimage-text-to-textdonutimage-to-textvisiondocument-question-answeringarxiv:2111.15664license:mitendpoints_compatibleregion:us
Use donut-base-finetuned-docvqa on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationnaver-clova-ix
TaskDocument Question Answering
Librarytransformers
Licensemit
Downloads/mo14K
Likes275
View on HuggingFace
See model card, files, and community discussion
Related Document Question Answering Models
impira/layoutlm-document-qa
85K
fxmarty/tiny-doc-qa-vision-encoder-decoder
3K
impira/layoutlm-invoices
1K
tiennvcs/layoutlmv2-base-uncased-finetuned-docvqa
603
zpm/Llama-3.1-PersianQA
367
Xenova/donut-base-finetuned-docvqa
204
AntonioTH/Layout-finetuned-fr-model-50instances20-100epochs-5e-05lr
188
xhyi/layoutlmv3_docvqa_t11c5000
167