NEWWhy single embeddings fail for video.Read the post →
    Models/Visual Question Answering/Minqin/carets_vqa_finetuned

    carets_vqa_finetuned

    by Minqin

    Identifier
    Model ID
    Minqin/carets_vqa_finetuned

    Tags

    transformerspytorchviltvisual-question-answeringendpoints_compatibleregion:us

    Use carets_vqa_finetuned on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder