NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/xlangai/OpenCUA-7B
    Image Text To Texttransformersmit

    OpenCUA-7B

    by xlangai

    135Kdl/month
    29likes
    Identifier
    Model ID
    xlangai/OpenCUA-7B

    Tags

    transformerssafetensorsopencuafeature-extractionVLMComputer-Use-AgentOS-AgentGUIGroundingimage-text-to-textconversationalcustom_codeendataset:xlangai/AgentNetdataset:xlangai/aguvis-stage1dataset:smolagents/aguvis-stage-2dataset:osunlp/UGround-V1-Dataarxiv:2508.09123arxiv:2504.07981base_model:Qwen/Qwen2.5-VL-7B-Instructbase_model:finetune:Qwen/Qwen2.5-VL-7B-Instructlicense:mitregion:us

    Use OpenCUA-7B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder