NEWAgents can now see video via MCP.Try it now →
    Models/Image Feature Extraction/DAMO-NLP-SG/VL3-SigLIP-NaViT
    Image Feature Extractiontransformersapache-2.0

    VL3-SigLIP-NaViT

    by DAMO-NLP-SG

    Identifier
    Model ID
    DAMO-NLP-SG/VL3-SigLIP-NaViT

    Tags

    transformerssafetensorsvideollama3_vision_encoderfeature-extractionvisual-encodermulti-modal-large-language-modelimage-feature-extractioncustom_codeenarxiv:2501.13106arxiv:2406.07476arxiv:2306.02858base_model:google/siglip-so400m-patch14-384base_model:finetune:google/siglip-so400m-patch14-384license:apache-2.0region:us

    Use VL3-SigLIP-NaViT on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder