NEWAgents can now see video via MCP.Try it now →
    Models/Reinforcement Learning/THU-KEG/LLaDA-8B-BGPO-countdown

    LLaDA-8B-BGPO-countdown

    by THU-KEG

    Identifier
    Model ID
    THU-KEG/LLaDA-8B-BGPO-countdown

    Tags

    safetensorslladareinforcement-learningplanningcountdowndllmbgpocustom_codeenarxiv:2510.11683license:apache-2.0region:us

    Use LLaDA-8B-BGPO-countdown on Mixpeek

    Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.

    Open Pipeline Builder