NEWAgents can now see video via MCP.Try it now →
    Models/Image To Text/tifa-benchmark/promptcap-coco-vqa
    Image To Texttransformersopenrail

    promptcap-coco-vqa

    by tifa-benchmark

    333dl/month
    13likes
    Identifier
    Model ID
    tifa-benchmark/promptcap-coco-vqa

    Tags

    transformerspytorchofaimage-to-textvisual-question-answeringimage-captioningendataset:cocodataset:textvqadataset:VQAv2dataset:OK-VQAdataset:A-OKVQAarxiv:2211.09699license:openrailregion:us

    Use promptcap-coco-vqa on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder