[R] Acting without Rewards
Here is our latest blog post. It is an “aside” from our regular demos – we have two new ones in the works, but we thought it would be interesting to share some research we did in the meantime.
Link the the post: https://ogma.ai/2019/08/acting-without-rewards/
The post talks about unsupervised behavior learning (UBL), a method for having an agent learn from every interaction with its environment. This method is similar in purpose to hindsight experience replay (HER), but functions very differently and offers different advantages.
Let us know what you think!