NEWWhy single embeddings fail for video.Read the post →
    Models/Visual Question Answering/aisuko/ft-vilt-b32-mlm
    Visual Question Answeringtransformersapache-2.0

    ft-vilt-b32-mlm

    by aisuko

    Identifier
    Model ID
    aisuko/ft-vilt-b32-mlm

    Tags

    transformerssafetensorsviltvisual-question-answeringgenerated_from_trainerdataset:vqabase_model:dandelin/vilt-b32-mlmbase_model:finetune:dandelin/vilt-b32-mlmlicense:apache-2.0endpoints_compatibleregion:us

    Use ft-vilt-b32-mlm on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder