NEWAgents can now see video via MCP.Try it now →
    Models/Visual Question Answering/dandelin/vilt-b32-finetuned-vqa
    Visual Question Answeringtransformersapache-2.0

    vilt-b32-finetuned-vqa

    by dandelin

    177Kdl/month
    421likes
    Identifier
    Model ID
    dandelin/vilt-b32-finetuned-vqa

    Tags

    transformerspytorchviltvisual-question-answeringarxiv:2102.03334license:apache-2.0endpoints_compatibleregion:us

    Use vilt-b32-finetuned-vqa on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder