Join our meetup, learn, connect, share, and get to know your Toronto AI community.
Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.
Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.
Hello good people!
Background : I need your help. First of all, I am out of my elements here. I am just learning about RL. I got a job on it luckily. It’s more code oriented but I need some concepts as well. I decided to throw myself in the water to break my stagnation. I hope you can help me here.
Issue : I want to run the code from the Rainbow paper. When I run it with default arguments it just keep running. I think by default it is set to run 5 million episodes(T-max
= 50e6). I want to run one successful run before I start playing with it so I have an idea on what the result is supposed to look like. Should I just change the T-max
variable? There are about 20 more arguments and I am not sure if it affects other or not. For example, I think the target-update
is related to this. And since my concepts are not so clear, I could use some help here.
I hope I was clear, if not please ask me here.
Edit : spelling and stuff
submitted by /u/loser-two-point-o
[link] [comments]