1Kdl/month
21likes
Identifier
Model ID
robotics-diffusion-transformer/RDT2-VQTags
transformerssafetensorsqwen2_5_vlimage-text-to-textRDTrdtRDT 2Vision-Language-ActionBimanualManipulationZero-shotUMIroboticsenarxiv:2602.03310base_model:Qwen/Qwen2.5-VL-7B-Instructbase_model:finetune:Qwen/Qwen2.5-VL-7B-Instructlicense:apache-2.0text-generation-inferenceendpoints_compatibleregion:us
Use RDT2-VQ on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioHow It Runs on Mixpeek
On Mixpeek, RDT2-VQ runs as a managed extractor inside a processing pipeline. Point a bucket of robotics data at it, and Mixpeek handles GPU provisioning, batching, retries, and writing the outputs into a vector store you can query.
Extractor outputs land in the Mixpeek Vector Store (MVS), where you can combine them with retrieval, reranking, and filter stages to build end-to-end search and agent-perception pipelines, no model-serving infrastructure to maintain.
Specification
Organizationrobotics-diffusion-transformer
TaskRobotics
Librarytransformers
Licenseapache-2.0
Downloads/mo1K
Likes21
View on HuggingFace
See model card, files, and community discussion