[D] Self Tuning Networks
I Read this paper a while ago and got super excited about it, but didn’t see it get implemented or talked about as i expected that it would, any ideas why?
Link to the paper: https://arxiv.org/abs/1903.03088
submitted by /u/El__Professor
[link] [comments]