Skip to main content

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

2
  • $\begingroup$ Great article @Gabe Verzino. I found the pitfalls you mentioned to be spot on for the classification problem I was working on. My data has an imbalance of 4:1, and balancing the data affected the performance when the model was supplied with real-world data. I had a fair amount of data, 400k samples for the majority class and 100k for the minority class. For my use case, adding more data was better for generalization than balancing the data. $\endgroup$ Commented Sep 22, 2021 at 18:06
  • $\begingroup$ That's awesome! Nice work. $\endgroup$ Commented Sep 22, 2021 at 21:03