NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/docling-project/SmolDocling-256M-preview
    Image Text To Texttransformerscdla-permissive-2.0

    SmolDocling-256M-preview

    by docling-project

    38Kdl/month
    1,614likes
    Identifier
    Model ID
    docling-project/SmolDocling-256M-preview

    Tags

    transformersonnxsafetensorsidefics3image-text-to-textconversationalendataset:ds4sd/SynthCodeNetdataset:ds4sd/SynthFormulaNetdataset:ds4sd/SynthChartNetdataset:HuggingFaceM4/DoclingMatixarxiv:2503.11576arxiv:2305.03393base_model:HuggingFaceTB/SmolVLM-256M-Instructbase_model:quantized:HuggingFaceTB/SmolVLM-256M-Instructlicense:cdla-permissive-2.0endpoints_compatibleregion:us

    Use SmolDocling-256M-preview on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder