[P] SOTA Atari learning with Recurrent IQN
Hey Guys,
I’ve recently implemented a recurrent version of the IQN reinforcement learning algorithm, combining IQN/Rainbow/R2D2 features, which can reach state-of-the-art (In sample efficiency) results on the Atari benchmark.
Any feedback is more than welcome.
submitted by /u/olieber
[link] [comments]