[D] On the Difficulty of Evaluating Baselines: A Study on Recommender Systems
Here is the paper https://arxiv.org/abs/1905.01395. I read this paper recently. I found it is quite interesting. It points out some issues in this research field. In my view, the key claim is that we need standardized benchmarks and the whole community should converge to well-calibrated results. I didn’t find any discussions here. So I create this post and look forward to some discussions.