[D] Deep-Learning The “Hardest” Go Problem in the World
https://blog.janestreet.com/deep-learning-the-hardest-go-problem-in-the-world/
Here’s a recent post about one of our experiments using AlphaZero-like selfplay learning to explore a fun little microcosm within Go that prior Go bots, including superhumanly strong AlphaZero-based bots, completely fail at.
Obviously, this is not an attempt at tackling any grand open challenge, but hopefully is still a useful case study and that touches on what seem to be some of the remaining weaknesses of modern deep RL in games.
Hope you find it interesting!
submitted by /u/icosaplex
[link] [comments]