Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto
-
Updated
Jul 18, 2020 - Jupyter Notebook
Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto
Implementation of some of the policy gradient methods in PyTorch.
Analysis of Bandit Algorithms on the Bernoulli Bandit Problem
This repository has been created just for warm-up in reinforcement learning and there are my simulation files of UT-RL course HWs.
Add a description, image, and links to the gradient-bandit topic page so that developers can more easily learn about it.
To associate your repository with the gradient-bandit topic, visit your repo's landing page and select "manage topics."