83dl/month
20likes
Identifier
Model ID
google/pix2struct-widget-captioning-largeTags
transformerspytorchsafetensorspix2structimage-text-to-textvisual-question-answeringenfrrodemultilingualarxiv:2210.03347license:apache-2.0region:us
Use pix2struct-widget-captioning-large on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationgoogle
TaskVisual Question Answering
Librarytransformers
Licenseapache-2.0
Downloads/mo83
Likes20
View on HuggingFace
See model card, files, and community discussion