[Research] Reinforcement Learning – Rainbow algorithm. Need some help with code
Hello good people!
Background : I need your help. First of all, I am out of my elements here. I am just learning about RL. I got a job on it luckily. It’s more code oriented but I need some concepts as well. I decided to throw myself in the water to break my stagnation. I hope you can help me here.
Issue : I want to run the code from the Rainbow paper. When I run it with default arguments it just keep running. I think by default it is set to run 5 million episodes(T-max
= 50e6). I want to run one successful run before I start playing with it so I have an idea on what the result is supposed to look like. Should I just change the T-max
variable? There are about 20 more arguments and I am not sure if it affects other or not. For example, I think the target-update
is related to this. And since my concepts are not so clear, I could use some help here.
I hope I was clear, if not please ask me here.
Edit : spelling and stuff
submitted by /u/loser-two-point-o
[link] [comments]