[D] How to select your data points when doing stochastic optimization?
Have people settled on anything better than the uniform that’s commonly used? Hopefully something unbiased & minimum variance. I’m finding a few papers 1, 2 and thought to check here before trying to implement any of them.
If it’s relevant, my data has 2e14 points and I can afford to do 150M of them.