Skip to content

Commit

Permalink
Update variance-problem.mdx
Browse files Browse the repository at this point in the history
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.

Thanks!
  • Loading branch information
BalajiAI authored Feb 17, 2024
1 parent 6ab84a4 commit 87fcfeb
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions units/en/unit6/variance-problem.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -27,4 +27,5 @@ However, increasing the batch size significantly **reduces sample efficiency**.
If you want to dive deeper into the question of variance and bias tradeoff in Deep Reinforcement Learning, you can check out these two articles:
- [Making Sense of the Bias / Variance Trade-off in (Deep) Reinforcement Learning](https://blog.mlreview.com/making-sense-of-the-bias-variance-trade-off-in-deep-reinforcement-learning-79cf1e83d565)
- [Bias-variance Tradeoff in Reinforcement Learning](https://www.endtoend.ai/blog/bias-variance-tradeoff-in-reinforcement-learning/)
- [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients)
---

0 comments on commit 87fcfeb

Please sign in to comment.