This repository contains the Pytorch implementation of Deep Q-Networks with hindsight experience replay to solve the bit flip problem consisted of 25 bits.
The number of bits can be changed; 25 and more are considered to be slightly difficult challenges.
- numpy == 1.18.2
- torch == 1.2.0
n_bits: Number of bits
lr: Learning rate
gamma: Discount factor
k_future: Number of episode for future hindsight strategy
Parameter | Value |
---|---|
n_bits | 25 |
lr | 1e-3 |
gamma | 0.98 |
memory_size | 1000000 |
batch_size | 128 |
k_future | 4 |