NEWWhy single embeddings fail for video.Read the post →
    Models/Visual Question Answering/mthsmtt/granite-vision-kvasir-vqa

    granite-vision-kvasir-vqa

    by mthsmtt

    Identifier
    Model ID
    mthsmtt/granite-vision-kvasir-vqa

    Tags

    peftsafetensorsllava_nextimage-text-to-textbase_model:adapter:ibm-granite/granite-vision-3.2-2bloratransformersmedicalvqavisual-question-answeringbase_model:ibm-granite/granite-vision-3.2-2btext-generation-inferenceendpoints_compatibleregion:us

    Use granite-vision-kvasir-vqa on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder