[Discussion] What is the status of the “Information Bottleneck Theory of Deep Learning”?
I am aware of the recent ICLR paper which tried to debunk some of the key claims in the general case. But the IB theory authors came back with a (rude) rebuttal on OpenReview with new experiments to show that it holds in the general case. I could not understand how valid they were from the author’s response to it.
The theory is complex with a lot of moving parts. I will be spending a lot of time on this if I go ahead and I also imagine there are few more people in similar position. Before that I wanted to check here if anyone relatively more experienced had a critical review of it (however brief). Is IB theory a promising or misdirected approach?