2Kdl/month
10likes
Identifier
Model ID
DAMO-NLP-SG/VL3-SigLIP-NaViTTags
transformerssafetensorsvideollama3_vision_encoderfeature-extractionvisual-encodermulti-modal-large-language-modelimage-feature-extractioncustom_codeenarxiv:2501.13106arxiv:2406.07476arxiv:2306.02858base_model:google/siglip-so400m-patch14-384base_model:finetune:google/siglip-so400m-patch14-384license:apache-2.0region:us
Use VL3-SigLIP-NaViT on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationDAMO-NLP-SG
TaskImage Feature Extraction
Librarytransformers
Licenseapache-2.0
Downloads/mo2K
Likes10
View on HuggingFace
See model card, files, and community discussion