NEWAgents can now see video via MCP.Try it now →
    Models/Image Text To Text/stepfun-ai/step3
    Image Text To Texttransformersapache-2.0

    step3

    by stepfun-ai

    161Kdl/month
    166likes
    Identifier
    Model ID
    stepfun-ai/step3

    Tags

    transformerssafetensorsstep3_vltext-generationimage-text-to-textconversationalcustom_codearxiv:2507.19427license:apache-2.0endpoints_compatibleregion:us

    Use step3 on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder