Clipped Action Policy Gradient and PPO with Entropy Regularization applied to Continuous RL Problems
Implementing CAPG from this paper and On-Policy PPO with Entropy Regularization from this paper and applying these algorithms on OpenAI Gym continuous environments