Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[P] Rainbow-IQN, that reaches the perfect score (+21) on Atari Pong within 100 episodes!

Hi, ML redditors all around the world! Recently, I have been studying Reinforcement Learning methods and implementing some of the ideas that could possibly improve my work. Not long ago I took a look at Implicit Quantile Networks for Distributional Reinforcement Learning (Dabney et al., 2018), and one sentence at the end caught my attention.

“Creating a Rainbow-IQN agent could yield even greater improvements on Atari-57.”

So I tried it. As the figure attached in the project readme, it learns Atari Pong incredibly faster than Rainbow as it reaches the perfect score (+21) within just 100 episodes. I am so happy to share this result, even though it is not so enough to evaluate the method’s performance objectively.

Plus, the repository contains other methods as well: A2C, PPO, DDPG, TD3, SAC, from Demonstration, Behaviour Cloning, and so on.

Any feedback will be so appreciated! Thanks 🙂

submitted by /u/Curt-Park
[link] [comments]