NEWWhy single embeddings fail for video.Read the post →
    Models/Visual Question Answering/tpedelose/vilt_finetuned_200
    Visual Question Answeringtransformersapache-2.0

    vilt_finetuned_200

    by tpedelose

    Identifier
    Model ID
    tpedelose/vilt_finetuned_200

    Tags

    transformerspytorchviltvisual-question-answeringgenerated_from_trainerbase_model:dandelin/vilt-b32-mlmbase_model:finetune:dandelin/vilt-b32-mlmlicense:apache-2.0endpoints_compatibleregion:us

    Use vilt_finetuned_200 on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder