NEWAgents can now see video via MCP.Try it now →
    Models/Visual Question Answering/BAAI/Aquila-VL-2B-llava-qwen
    Visual Question Answeringtransformersapache-2.0

    Aquila-VL-2B-llava-qwen

    by BAAI

    Identifier
    Model ID
    BAAI/Aquila-VL-2B-llava-qwen

    Tags

    transformerssafetensorsqwen2text-generationmultimodalvisual-question-answeringenzhdataset:BAAI/Infinity-MMdataset:BAAI/Infinity-Instructdataset:BAAI/Infinity-Preferencearxiv:2410.18558base_model:Qwen/Qwen2.5-1.5B-Instructbase_model:finetune:Qwen/Qwen2.5-1.5B-Instructlicense:apache-2.0text-generation-inferenceendpoints_compatibleregion:us

    Use Aquila-VL-2B-llava-qwen on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder