Learn About Our Meetup

5000+ Members



Join our meetup, learn, connect, share, and get to know your Toronto AI community. 



Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.



Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[Research] Reinforcement Learning – Rainbow algorithm. Need some help with code

Hello good people!

Background : I need your help. First of all, I am out of my elements here. I am just learning about RL. I got a job on it luckily. It’s more code oriented but I need some concepts as well. I decided to throw myself in the water to break my stagnation. I hope you can help me here.

Issue : I want to run the code from the Rainbow paper. When I run it with default arguments it just keep running. I think by default it is set to run 5 million episodes(T-max = 50e6). I want to run one successful run before I start playing with it so I have an idea on what the result is supposed to look like. Should I just change the T-max variable? There are about 20 more arguments and I am not sure if it affects other or not. For example, I think the target-update is related to this. And since my concepts are not so clear, I could use some help here.

I hope I was clear, if not please ask me here.

Edit : spelling and stuff

submitted by /u/loser-two-point-o
[link] [comments]

Toronto AI is a social and collaborative hub to unite AI innovators of Toronto and surrounding areas. We explore AI technologies in digital art and music, healthcare, marketing, fintech, vr, robotics and more. Toronto AI was founded by Dave MacDonald and Patrick O'Mara.