Reinforcement Learning

Self-study and implementations of deep reinforcement learning papers/algorithms with a friend.

The following algorithms can be found in the repo:

Tabular Q-Learning
Deep Q-Learning
[Paper: Playing Atari with Deep Reinforcement Learning]
REINFORCE (Vanilla Policy Gradient with Monte Carlo returns)
Advantage Actor Critic (A2C)
[Paper: Asynchronous Methods for Deep Reinforcement Learning]
Proximal Policy Optimization (PPO)
[Paper: Proximal Policy Optimization Algorithms]
Deep Deterministic Policy Gradients (DDPG)
[Paper: Continuous control with deep reinforcement learning]
Dynamics Randomization for RL Transfer Learning
[Paper: Sim-to-Real Transfer of Robotic Control with Dynamics Randomization]

Install dependencies using pip3 install -r requirements.txt
Each script has train and test methods. To call them, do python3 <script_name> <method_name>. For example: python3 REINFORCE.py train
The test method will load a model from the models directory. Pre-trained models for some algorithms can be found in this repo.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
baselines		baselines
environments		environments
models		models
.gitignore		.gitignore
A2C_TD.py		A2C_TD.py
DDPG.py		DDPG.py
DQN.py		DQN.py
PPO_keras.py		PPO_keras.py
PPO_torch.py		PPO_torch.py
Q_Learning.py		Q_Learning.py
README.md		README.md
REINFORCE.py		REINFORCE.py
dynamics_randomization.py		dynamics_randomization.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback