551dl/month
1likes
Identifier
Model ID
THU-KEG/LLaDA-8B-BGPO-countdownTags
safetensorslladareinforcement-learningplanningcountdowndllmbgpocustom_codeenarxiv:2510.11683license:apache-2.0region:us
Use LLaDA-8B-BGPO-countdown on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationTHU-KEG
TaskReinforcement Learning
Licenseapache-2.0
Downloads/mo551
Likes1
View on HuggingFace
See model card, files, and community discussion