690dl/month
15likes
Identifier
Model ID
omlab/VLM-FO1_Qwen2.5-VL-3B-v01Tags
safetensorsomchat_qwen2_5_vlobject-detectionmultimodalRECVLMzero-shot-object-detectionzhenarxiv:2509.25916base_model:Qwen/Qwen2.5-VL-3B-Instructbase_model:finetune:Qwen/Qwen2.5-VL-3B-Instructlicense:apache-2.0region:us
Use VLM-FO1_Qwen2.5-VL-3B-v01 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval in Mixpeek Studio.
Open StudioSpecification
Organizationomlab
TaskObject Detection
Licenseapache-2.0
Downloads/mo690
Likes15
View on HuggingFace
See model card, files, and community discussion
Related Object Detection Models
microsoft/table-transformer-detection
1.8M
microsoft/table-transformer-structure-recognition
1.3M
hustvl/yolos-small
746K
PekingU/rtdetr_v2_r50vd
491K
PaddlePaddle/PP-DocLayoutV3_safetensors
348K
facebook/detr-resnet-50
300K
TahaDouaji/detr-doc-table-detection
199K
microsoft/table-transformer-structure-recognition-v1.1-all
175K