333dl/month
13likes
Identifier
Model ID
tifa-benchmark/promptcap-coco-vqaTags
transformerspytorchofaimage-to-textvisual-question-answeringimage-captioningendataset:cocodataset:textvqadataset:VQAv2dataset:OK-VQAdataset:A-OKVQAarxiv:2211.09699license:openrailregion:us
Use promptcap-coco-vqa on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationtifa-benchmark
TaskImage To Text
Librarytransformers
Licenseopenrail
Downloads/mo333
Likes13
View on HuggingFace
See model card, files, and community discussion