Self-study and implementations of deep reinforcement learning papers/algorithms with a friend.
The following algorithms can be found in the repo:
- Tabular Q-Learning
- Deep Q-Learning
[Paper: Playing Atari with Deep Reinforcement Learning] - REINFORCE (Vanilla Policy Gradient with Monte Carlo returns)
- Advantage Actor Critic (A2C)
[Paper: Asynchronous Methods for Deep Reinforcement Learning] - Proximal Policy Optimization (PPO)
[Paper: Proximal Policy Optimization Algorithms] - Deep Deterministic Policy Gradients (DDPG)
[Paper: Continuous control with deep reinforcement learning]
- Install dependencies using
pip3 install -r requirements.txt
- Each script has
train
andtest
methods. To call them, dopython3 <script_name> <method_name>
. For example:python3 REINFORCE.py train
- The
test
method will load a model from themodels
directory. Pre-trained models for some algorithms can be found in this repo.