Timeline for Pandas Groupby operation on a large dataset
Current License: CC BY-SA 4.0
17 events
| when toggle format | what | by | license | comment | |
|---|---|---|---|---|---|
| Apr 28, 2020 at 18:00 | history | tweeted | twitter.com/StackCodeReview/status/1255195077342830593 | ||
| Apr 26, 2020 at 16:08 | history | edited | Jamal | CC BY-SA 4.0 | deleted 30 characters in body; edited title |
| Apr 26, 2020 at 5:42 | answer | added | peteron30 | timeline score: 1 | |
| Apr 25, 2020 at 18:42 | answer | added | Juho | timeline score: 3 | |
| Apr 25, 2020 at 17:33 | comment | added | peteron30 | Let us continue this discussion in chat. | |
| Apr 25, 2020 at 17:33 | history | edited | peteron30 | CC BY-SA 4.0 | deleted 1 character in body |
| Apr 25, 2020 at 17:28 | comment | added | Juho | Running your code with the mentioned arguments gives me KeyError: 'Only a column name can be used for the key in a dtype mappings argument.' | |
| Apr 25, 2020 at 17:26 | comment | added | peteron30 | @Juho use python script.py 1947 1948 to run for 1947 only. It will take <7 minutes on your end, since I removed some lines. The last groupby is what I am trying to optimize. | |
| Apr 25, 2020 at 17:19 | comment | added | peteron30 | @Juho sorry about that. code fixed. should work now. | |
| Apr 25, 2020 at 17:16 | history | edited | peteron30 | CC BY-SA 4.0 | deleted 546 characters in body |
| Apr 25, 2020 at 16:57 | history | edited | peteron30 | CC BY-SA 4.0 | added 89 characters in body |
| Apr 25, 2020 at 16:52 | comment | added | peteron30 | @Juho sample file is provided. It takes ~7 minutes to process this file. drive.google.com/file/d/1hJN7dYYpcG73PSJOB8zukAR14FnfZt_e/… | |
| Apr 25, 2020 at 16:50 | history | edited | peteron30 | CC BY-SA 4.0 | deleted 17 characters in body |
| Apr 22, 2020 at 15:50 | comment | added | RootTwo | Have you considered using other libraries besides or in addition to pandas, like Dask? It has parallel versions of .groupby and can be set up to use multiple cores or computers. | |
| Apr 20, 2020 at 20:49 | comment | added | Mast♦ | Welcome to Code Review! Please state a bit more about the goal of your program, perhaps read the FAQ on asking questions to get the most out of your question. | |
| Apr 20, 2020 at 20:35 | review | First posts | |||
| Apr 20, 2020 at 21:11 | |||||
| Apr 20, 2020 at 20:30 | history | asked | peteron30 | CC BY-SA 4.0 |