NEWVectors or files. Pick a path.Start →
    Models/Visual Question Answering/Joe99/visionlanguageTransformer
    Visual Question Answeringtransformersapache-2.0

    visionlanguageTransformer

    by Joe99

    Identifier
    Model ID
    Joe99/visionlanguageTransformer

    Tags

    transformerspytorchviltvisual-question-answeringenarxiv:2102.03334license:apache-2.0endpoints_compatibleregion:us

    Use visionlanguageTransformer on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.

    Open Studio