Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
wangjia184 authored Nov 12, 2022
1 parent cb80f60 commit 646e980
Showing 1 changed file with 2 additions and 4 deletions.
6 changes: 2 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,8 +32,8 @@ The following graph shows the policy-value network from AlphaGo Zero.
It has been simplified here:

1. Number of residual blocks is reduced from 39 to 19.
2. Residual block width is narrowed from 256 filters down to 32 filters.
3. Since width is reduced to 1/8 and dying ReLU problem was encountered in first attempt, hence activation function ReLU is replaced with PReLU and ELU.
2. Residual block width is narrowed from 256 filters down to 64 filters.
3. Since width is reduced to 1/4 and dying ReLU problem was encountered in first attempt, hence activation function ReLU of last dense layer is replaced with ELU.
4. Input is simplified to a `(1, 4, 15, 15)` NCHW tensor with four 15x15 planes.
* The first plane represents stones of current player
* The second plane represents stones of opponent player
Expand All @@ -43,9 +43,7 @@ It has been simplified here:

$$ l = (z-v)^{2}-\pi ^{T}ln(p)+c\left \|\| \theta \right \|\|^{2} $$

Here comes the model graph

![Model](./model.png)



0 comments on commit 646e980

Please sign in to comment.