Self-study and implementations of deep reinforcement learning papers/algorithms with a friend.
The following algorithms can be found in the repo:
- Tabular Q-Learning
- Deep Q-Learning
[Paper: Playing Atari with Deep Reinforcement Learning] - REINFORCE (Vanilla Policy Gradient with Monte Carlo returns)
- Advantage Actor Critic (A2C)
[Paper: Asynchronous Methods for Deep Reinforcement Learning] - Proximal Policy Optimization (PPO)
[Paper: Proximal Policy Optimization Algorithms] - Deep Deterministic Policy Gradients (DDPG)
[Paper: Continuous control with deep reinforcement learning] - Dynamics Randomization for RL Transfer Learning
[Paper: Sim-to-Real Transfer of Robotic Control with Dynamics Randomization]
- Install dependencies using
pip3 install -r requirements.txt - Each script has
trainandtestmethods. To call them, dopython3 <script_name> <method_name>. For example:python3 REINFORCE.py train - The
testmethod will load a model from themodelsdirectory. Pre-trained models for some algorithms can be found in this repo.