NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/ByteDance-Seed/UI-TARS-1.5-7B
    Image Text To Texttransformersapache-2.0

    UI-TARS-1.5-7B

    by ByteDance-Seed

    41Kdl/month
    538likes
    Identifier
    Model ID
    ByteDance-Seed/UI-TARS-1.5-7B

    Tags

    transformerssafetensorsqwen2_5_vlimage-text-to-textmultimodalguiconversationalenarxiv:2501.12326arxiv:2404.07972arxiv:2409.08264arxiv:2401.13919arxiv:2504.01382arxiv:2405.14573arxiv:2410.23218arxiv:2504.07981license:apache-2.0eval-resultstext-generation-inferenceendpoints_compatibledeploy:azureregion:us

    Use UI-TARS-1.5-7B on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder