[D] Should you standardize your numerical features if the distribution is unknown of non gaussian?
*or non gaussian
I get the point in having your features be of the same scaling but wouldn’t min max scaling (x-xmin/xmax-xmin) work better? Every tutorial Ive seen about data cleaning seems to use the StandardScaler() without looking at the underlying data.
submitted by /u/Trevahhhh
[link] [comments]