NEWAgents can now see video via MCP.Try it now →
    Models/Visual Question Answering/microsoft/git-large-vqav2

    git-large-vqav2

    by microsoft

    141dl/month
    19likes
    Identifier
    Model ID
    microsoft/git-large-vqav2

    Tags

    transformerspytorchsafetensorsgitimage-text-to-textvisionvisual-question-answeringenarxiv:2205.14100license:mitendpoints_compatibleregion:us

    Use git-large-vqav2 on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder