NEWWhy single embeddings fail for video.Read the post →
    Models/Image Text To Text/Qwen/Qwen3-VL-2B-Instruct
    Image Text To Texttransformersapache-2.0

    Qwen3-VL-2B-Instruct

    by Qwen

    149.0Mdl/month
    405likes
    Identifier
    Model ID
    Qwen/Qwen3-VL-2B-Instruct

    Tags

    transformerssafetensorsqwen3_vlimage-text-to-textconversationalarxiv:2505.09388arxiv:2502.13923arxiv:2409.12191arxiv:2308.12966license:apache-2.0eval-resultsendpoints_compatibledeploy:azureregion:us

    Use Qwen3-VL-2B-Instruct on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.

    Open Studio