#

gradient-bandit

Here are 5 public repositories matching this topic...

SanketAgrawal / ReinforcementLearning

Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto

reinforcement-learning artificial-intelligence epsilon-greedy python-3 ucb k-armed-bandit gradient-bandit optimistic-inital-values

Updated Jul 18, 2020
Jupyter Notebook

hritikb / Reinforcement-Learning-Algorithms

reinforcement-learning q-learning grid-world epsilon-greedy sarsa dynamic-programming multi-armed-bandits policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning upper-confidence-bound gradient-bandit optimistic-inital-values greedy-policy

Updated Jun 29, 2023
Jupyter Notebook

MehranTaghian / policy-gradient-methods

Implementation of some of the policy gradient methods in PyTorch.

pytorch policy-gradient reinforce actor-critic ppo online-supervised-learning gradient-bandit batch-reinforce

Updated Jul 27, 2022
Python

ma-nadeau / BanditAlgorithms

Analysis of Bandit Algorithms on the Bernoulli Bandit Problem

reinforcement-learning thompson-sampling epsilon-greedy bandit-algorithms gradient-bandit

Updated Mar 10, 2025
Jupyter Notebook

Taabannn / intro-rl

This repository has been created just for warm-up in reinforcement learning and there are my simulation files of UT-RL course HWs.

reinforcement-learning monte-carlo q-learning statistical-inference dqn epsilon-greedy ddpg policy-iteration value-iteration q-learning-vs-sarsa sarsa-algorithm gradient-bandit ucb-algorithm

Updated Dec 8, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the gradient-bandit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gradient-bandit topic, visit your repo's landing page and select "manage topics."