Timeline for When is unbalanced data really a problem in Machine Learning?
Current License: CC BY-SA 4.0
5 events
| when toggle format | what | by | license | comment | |
|---|---|---|---|---|---|
| Sep 22, 2021 at 21:03 | comment | added | Gabe Verzino | That's awesome! Nice work. | |
| Sep 22, 2021 at 18:06 | comment | added | Raqib | Great article @Gabe Verzino. I found the pitfalls you mentioned to be spot on for the classification problem I was working on. My data has an imbalance of 4:1, and balancing the data affected the performance when the model was supplied with real-world data. I had a fair amount of data, 400k samples for the majority class and 100k for the minority class. For my use case, adding more data was better for generalization than balancing the data. | |
| Jul 31, 2021 at 22:11 | review | Late answers | |||
| Jul 31, 2021 at 22:14 | |||||
| Jul 31, 2021 at 21:57 | review | First posts | |||
| Aug 1, 2021 at 0:03 | |||||
| Jul 31, 2021 at 21:53 | history | answered | Gabe Verzino | CC BY-SA 4.0 |