[Research] Reinforcement Learning – Rainbow algorithm. Need some help with code

Hello good people!

Background : I need your help. First of all, I am out of my elements here. I am just learning about RL. I got a job on it luckily. It’s more code oriented but I need some concepts as well. I decided to throw myself in the water to break my stagnation. I hope you can help me here.

Issue : I want to run the code from the Rainbow paper. When I run it with default arguments it just keep running. I think by default it is set to run 5 million episodes(T-max = 50e6). I want to run one successful run before I start playing with it so I have an idea on what the result is supposed to look like. Should I just change the T-max variable? There are about 20 more arguments and I am not sure if it affects other or not. For example, I think the target-update is related to this. And since my concepts are not so clear, I could use some help here.

I hope I was clear, if not please ask me here.

Edit : spelling and stuff

submitted by /u/loser-two-point-o
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

[Research] Reinforcement Learning – Rainbow algorithm. Need some help with code