1Kdl/month
21likes
Identifier
Model ID
microsoft/git-base-vqav2Tags
transformerspytorchsafetensorsgitimage-text-to-textvisionvisual-question-answeringenarxiv:2205.14100license:mitregion:us
Use git-base-vqav2 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
Organizationmicrosoft
TaskVisual Question Answering
Librarytransformers
Licensemit
Downloads/mo1K
Likes21
View on HuggingFace
See model card, files, and community discussion