Blog

Learn About Our Meetup

5000+ Members

GO >

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community.

JOIN

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

JOBS

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

CONTACT

[D] Why re-sampling imbalanced data isn’t always the best idea

Written by torontoai on November 1, 2019. Posted in Reddit MachineLearning.

I often times work with people (medical studies) with a huge “knowledge” on statistical methods but none of the required basics or understanding what goes on inside some algorithms. That’s perfectly fine because after all that’s not their job but mine.

But over time, I’ve come across a few problems where (due to not finding the “needed significance”) some really basic over-sampling was applied. I’ve thrown together a really simple example, that anyone should be able to follow (without any deep statistical knowledge) to showcase what could happen – maybe it helps you or you can use it to your help:

https://stroemer.cc/resample-imbalanced-data/

submitted by /u/kchnkrml
[link] [comments]