[D] Trying to understand the proof in “Weight Uncertainty in Neural Networks” by DeepMind
Can someone please explain this proof from one of DeepMind’s papers ( https://arxiv.org/pdf/1505.05424.pdf ). I am having difficulty understanding where the second term in the expectation comes from? submitted by /u/brandinho77 |