[R] Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator
We recently released a preprint analyzing policy iteration on LQR. Here’s the arxiv paper: https://arxiv.org/abs/1905.12842
And a tl;dr explanation on Twitter: https://twitter.com/Krauth/status/1134630118930997249
submitted by /u/karilex
[link] [comments]