A new training mode called "both" has been added. This allows simultaneous training with both PPO and SAC, each on a separate instance. For example, if you have 4 environments, the even-numbered ones will use PPO, and the odd-numbered ones will use SAC. Therefore, it's necessary to specify even values for num_envs. #6233
Cycode Security / Cycode: Secrets
succeeded
Sep 13, 2025 in 0s
Good job! No secrets were found in this pull request
Cycode/Secrets Scan
Loading