NEWWhy single embeddings fail for video.Read the post →
    Models/Visual Question Answering/MohamedTahir/ViLTVQA
    Visual Question Answeringtransformersapache-2.0

    ViLTVQA

    by MohamedTahir

    Identifier
    Model ID
    MohamedTahir/ViLTVQA

    Tags

    transformerssafetensorsviltvisual-question-answeringlicense:apache-2.0endpoints_compatibleregion:us

    Use ViLTVQA on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder