NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/Qwen/Qwen2.5-VL-3B-Instruct
    Image Text To Texttransformers

    Qwen2.5-VL-3B-Instruct

    by Qwen

    4.3Mdl/month
    640likes
    Identifier
    Model ID
    Qwen/Qwen2.5-VL-3B-Instruct

    Tags

    transformerssafetensorsqwen2_5_vlimage-text-to-textmultimodalconversationalenarxiv:2309.00071arxiv:2409.12191arxiv:2308.12966eval-resultstext-generation-inferenceendpoints_compatibledeploy:azureregion:us

    Use Qwen2.5-VL-3B-Instruct on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder