[Research] Learning Finite State Representations of Recurrent Policy Networks | Deep Reinforcement Learning | Playing Pong with 3 states

Written by torontoai on April 27, 2019. Posted in Reddit MachineLearning.

Hello Everyone,

This is to share the code of a recent work “Learning Finite State Representations of Recurrent Policy Networks”.

Abstract: Recurrent neural networks (RNNs) are an effective representation of control policies for a wide range of reinforcement and imitation learning problems. RNN policies, however, are particularly difficult to explain, understand, and analyze due to their use of continuous-valued memory vectors and observation features. In this paper, we introduce a new technique, Quantized Bottleneck Insertion, to learn finite representations of these vectors and features. The result is a quantized representation of the RNN that can be analyzed to improve our understanding of memory use and general behavior. We present results of this approach on synthetic environments and six Atari games. The resulting finite representations are surprisingly small in some cases, using as few as 3 discrete memory states and 10 observations for a perfect Pong policy. We also show that these finite policy representations lead to improved interpretability.

Paper: https://openreview.net/forum?id=S1gOpsCctm

Source Code: https://github.com/koulanurag/mmn

Hopefully, It will be helpful to others.

submitted by /u/HeavyStatus4
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

[Research] Learning Finite State Representations of Recurrent Policy Networks | Deep Reinforcement Learning | Playing Pong with 3 states