76dl/month
6likes
Identifier
Model ID
microsoft/git-large-textvqaTags
transformerspytorchsafetensorsgitimage-text-to-textvisionvisual-question-answeringenarxiv:2205.14100license:mitregion:us
Use git-large-textvqa on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationmicrosoft
TaskVisual Question Answering
Librarytransformers
Licensemit
Downloads/mo76
Likes6
View on HuggingFace
See model card, files, and community discussion