[D] Any paper about inner-update for different layers?
So as we know for MAML-like meta learning, many works inner-update the top layers while meta-update the backbone. Is there any paper showing the results about the difference for inner-updatting different number of top layers?
submitted by /u/ARXrean
[link] [comments]