| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README.md | 2022-06-07 | 5.7 kB | |
| v1.0.0b1 CleanRL Beta Release source code.tar.gz | 2022-06-07 | 33.5 MB | |
| v1.0.0b1 CleanRL Beta Release source code.zip | 2022-06-07 | 33.6 MB | |
| Totals: 3 Items | 67.2 MB | 0 | |
🎉 I am thrilled to announce the v1.0.0b1 CleanRL Beta Release. CleanRL has come a long way making high-quality deep reinforcement learning implementations easy to understand. In this release, we have put a huge effort into revamping our documentation site, making our implementation friendly to use for new users.
I would like to cordially thank the core dev members @dosssman @yooceii @Dipamc77 @bragajj for their efforts in helping maintain the CleanRL repository. I would also like to give a shout-out to our new contributors @ElliotMunro200 and @Dipamc77.
New CleanRL supported publications
- Huang, S., Dossa, R., Raffin, A., Kanervisto, A., & Wang, W. (2022). The 37 Implementation Details of Proximal Policy Optimization, International Conference on Learning Representations 2022 Blog Post Track
- Huang, S., Ontañón, S., (2022). A Closer Look at Invalid Action Masking in Policy Gradient Algorithms, The International FLAIRS Conference Proceedings, 35.
- Schmidt, D., & Schmied, T. (2021). Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari Deep Reinforcement Learning Workshop at the 35th Conference on Neural Information Processing Systems
New algorithm variants
- Match PPG implementation by @Dipamc77 in https://github.com/vwxyzjn/cleanrl/pull/186
- See the documentation here: https://docs.cleanrl.dev/rl-algorithms/ppg/
- Proper multi-gpu support with PPO by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/178
- See the documentation here: https://docs.cleanrl.dev/rl-algorithms/ppo/#ppo_atari_multigpupy
- Support Pettingzoo Multi-agent Atari envs with PPO by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/188
- See the documentation here: https://docs.cleanrl.dev/rl-algorithms/ppo/#ppo_pettingzoo_ma_ataripy
Refactoring changes
- Let
ppo_continuous_action.pyonly run 1M steps by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/161 - Change
ppo.py's default timesteps by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/164 - Enable video recording for
ppo_procgen.pyby @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/166 - Refactor replay based scripts by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/173
Documentation changes
A significant amount of documentation changes (tracked by https://github.com/vwxyzjn/cleanrl/issues/121).
See the overview documentation page here: https://docs.cleanrl.dev/rl-algorithms/overview/
- Add
ddpg_continuous_action.pydocs by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/137 - Fix DDPG docs' description by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/139
- Fix typo in DDPG docs by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/140
- Fix incorrect links in the DDPG docs by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/142
- DDPG documnetation tweaks; added Q loss equations and light explanation by @dosssman in https://github.com/vwxyzjn/cleanrl/pull/145
- Add
dqn_atari.pydocumentation by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/124 - Add documentation for
td3_continuous_action.pyby @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/141 - SAC Documentation - Benchmarks - Minor code tweaks by @dosssman in https://github.com/vwxyzjn/cleanrl/pull/146
- Add docs for
c51.pyandc51_atari.pyby @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/159 - Add docs for
dqn.pyby @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/157 - Address stale documentation by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/169
- Documentation improvement - fix links and mkdocs by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/181
- Improve documentation and contribution guide by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/189
- Fix documentation links in README.md by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/192
- Fix the implemented varaints section in PPO by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/193
Misclanouse changes
- Add Pull Request template by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/122
- Amend license to give proper attribution by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/152
- Introduce better contribution guide by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/154
- Fix the default wandb project name in
ppo_atari_envpool.pyby @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/160 - Removes unmaintained scripts by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/170
- Add PPO documentation by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/163
- Add docs header by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/174
- Update README.md by @ElliotMunro200 in https://github.com/vwxyzjn/cleanrl/pull/177
- Update issue_template.md by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/180
- Temporarily Remove PPO-RND by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/190
Utility changes
- Export
requirements.txtautomatically by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/143 - Auto-upgrade syntax via
pyupgradeby @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/158 - Introduce benchmark utilities by @vwxyzjn in https://github.com/vwxyzjn/cleanrl/pull/165
New Contributors
- @ElliotMunro200 made their first contribution in https://github.com/vwxyzjn/cleanrl/pull/177
- @Dipamc77 made their first contribution in https://github.com/vwxyzjn/cleanrl/pull/186
Full Changelog: https://github.com/vwxyzjn/cleanrl/compare/v0.6.0...v1.0.0b1