[R] Stupid question about training/test data to check feature importance.
Hi, I’m fairly new to this and planning on using random forests to try and see which variables most affect the outcome of another variable.
In this case, I can’t exactly see what the training data would be used for.
I don’t really understand what splitting my data would achieve, can someone explain?
submitted by /u/Frogad
[link] [comments]