NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/meta-llama/Llama-3.2-11B-Vision-Instruct
    Image Text To Texttransformersllama3.2

    Llama-3.2-11B-Vision-Instruct

    by meta-llama

    135Kdl/month
    1,587likes
    Identifier
    Model ID
    meta-llama/Llama-3.2-11B-Vision-Instruct

    Tags

    transformerssafetensorsmllamaimage-text-to-textfacebookmetapytorchllamallama-3conversationalendefritpthiestharxiv:2204.05149license:llama3.2text-generation-inferenceendpoints_compatibleregion:us

    Use Llama-3.2-11B-Vision-Instruct on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder