783dl/month
6likes
Identifier
Model ID
JonnyYu828/DepthVLM-4BTags
transformerssafetensorsqwen3_vlimage-text-to-textvision-language-modeldepth-estimation3d-visionmultimodalenarxiv:2605.15876base_model:Qwen/Qwen3-VL-4B-Instructbase_model:finetune:Qwen/Qwen3-VL-4B-Instructlicense:apache-2.0endpoints_compatibleregion:us
Use DepthVLM-4B on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
OrganizationJonnyYu828
TaskDepth Estimation
Librarytransformers
Licenseapache-2.0
Downloads/mo783
Likes6
View on HuggingFace
See model card, files, and community discussion