Skip to main content
NoName's user avatar
NoName's user avatar
NoName's user avatar
NoName
  • Member for 6 years
  • Last seen more than 3 years ago
comment
Why apply a 50:50 train test split?
Only if the true distribution does not change significantly through time. e.g. time series data. You don't want to be validating today's economics with economics from the 1950s.
comment
comment
Merging large CSV files in pandas
But pandas holds its DataFrames in memory, would you really have enough RAM for large data sets?
awarded
comment
Do data scientists use Excel?
People without a business background don't use excel. Period. And considering business graduates don't usually go into data science, you can understand the ignorance.
comment
Do data scientists use Excel?
Reading comprehension guys. "We are interviewing for a data scientist position...", meaning he's not the employer. It is a group intervew. He's just judging the other candidate being interviewed.