23dl/month
4likes
Identifier
Model ID
ivelin/donut-refexp-combined-v1Tags
transformerspytorchvision-encoder-decoderimage-text-to-textui refexpvisual-question-answeringendataset:ivelin/rico_refexp_combinedlicense:agpl-3.0endpoints_compatibleregion:us
Use donut-refexp-combined-v1 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationivelin
TaskVisual Question Answering
Librarytransformers
Licenseagpl-3.0
Downloads/mo23
Likes4
View on HuggingFace
See model card, files, and community discussion