NEWWhy single embeddings fail for video.Read the post →
    Models/Image Feature Extraction/mlx-vision/vit_base_patch16_224.dinov3-mlxim

    vit_base_patch16_224.dinov3-mlxim

    by mlx-vision

    Identifier
    Model ID
    mlx-vision/vit_base_patch16_224.dinov3-mlxim

    Tags

    mlx-imagesafetensorsmlxvisiondinov3image-feature-extractionarxiv:2010.11929arxiv:2508.10104license:otherregion:us

    Use vit_base_patch16_224.dinov3-mlxim on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder