You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/release-notes/next-release.md
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,3 +6,4 @@
6
6
* Allow episode metrics during eval to be normalized by the episode length, as long as the metric name ends with "per_step".
7
7
* Add adaptive learning rate to PPO. Desired KL is sensitive to network initialization weights and entropy cost and may require some tuning for your environment.
8
8
* Add loss metrics to the PPO training logger.
9
+
* Add `donate_argnums` to brax PPO to somewhat mitigate repeated graph captures when using MJX-Warp.
0 commit comments