133dl/month
13likes
Identifier
Model ID
omlab/VLM-FO1_Qwen2.5-VL-3B-v01Tags
safetensorsomchat_qwen2_5_vlobject-detectionmultimodalRECVLMzero-shot-object-detectionzhenarxiv:2509.25916base_model:Qwen/Qwen2.5-VL-3B-Instructbase_model:finetune:Qwen/Qwen2.5-VL-3B-Instructlicense:apache-2.0region:us
Use VLM-FO1_Qwen2.5-VL-3B-v01 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
Organizationomlab
TaskObject Detection
Licenseapache-2.0
Downloads/mo133
Likes13
View on HuggingFace
See model card, files, and community discussion
Related Object Detection Models
microsoft/table-transformer-detection
3.5M
microsoft/table-transformer-structure-recognition-v1.1-all
1.4M
microsoft/table-transformer-structure-recognition
1.3M
hustvl/yolos-small
705K
valentinafevu/yolos-fashionpedia
566K
PaddlePaddle/PP-DocLayoutV3_safetensors
282K
facebook/detr-resnet-50
222K
TahaDouaji/detr-doc-table-detection
209K