C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)
reinforcement-learning ucb multi-armed-bandits softmax bandit-algorithms softmax-policy bernoulli-bandit gaussian-bandit
-
Updated
Apr 7, 2021 - C++