3Kdl/month
44likes
Identifier
Model ID
google/pix2struct-docvqa-baseTags
transformerspytorchsafetensorspix2structimage-text-to-textvisual-question-answeringenfrrodemultilingualarxiv:2210.03347license:apache-2.0region:us
Use pix2struct-docvqa-base on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
Organizationgoogle
TaskVisual Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo3K
Likes44
View on HuggingFace
See model card, files, and community discussion