378dl/month
Identifier
Model ID
Sudhish-Poojary/ppo-LunarLander-v3Tags
stable-baselines3LunarLander-v3deep-reinforcement-learningreinforcement-learningmodel-indexregion:us
Use ppo-LunarLander-v3 on Mixpeek
Build multimodal processing pipelines with this model and others. Extract features, run inference, and set up retrieval, all through the Mixpeek pipeline builder.
Open Pipeline BuilderSpecification
OrganizationSudhish-Poojary
TaskReinforcement Learning
Librarystable-baselines3
Downloads/mo378
View on HuggingFace
See model card, files, and community discussion