[D] Natural Gradient Descent/ Hessian/ Fisher papers
I am reading about Natural gradient descent. Usually I find it quite helpful if I can get to read quite a bunch of papers concerning the topic, I am figuring out good papers online, but if someone from the community can point me to some good papers, it would be great.
Can some one point me to a list of good natural gradient descent papers which would address the following:
- Starting from the S Amari’s paper to the recent papers.
- Natural gradient descent fisher approximation papers
- Natural gradient descent papers which highlight benefits in preventing saddle points/ achieving local minima.
- Hessian matrix approximation papers
- Other good papers covering techniques like KFAC, hessian matrices, fisher matrices, their approximations, NSGD