NEWAgents can now see video via MCP.Try it now →
    Models/Visual Question Answering/Bingsu/temp_vilt_vqa

    temp_vilt_vqa

    by Bingsu

    Identifier
    Model ID
    Bingsu/temp_vilt_vqa

    Tags

    transformerspytorchviltvisual-question-answeringendpoints_compatibleregion:us

    Use temp_vilt_vqa on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder