May-24-2018, 11:49 AM
You are obviously running out of memory.
You probably should find a way to split your data into chunks and process it in smaller portions - or increase the amount of available RAM
pandas
is a memory hog - see this article. Quoting the authorQuote:my rule of thumb for pandas is that you should have 5 to 10 times as much RAM as the size of your dataset
You probably should find a way to split your data into chunks and process it in smaller portions - or increase the amount of available RAM
Test everything in a Python shell (iPython, Azure Notebook, etc.)
- Someone gave you an advice you liked? Test it - maybe the advice was actually bad.
- Someone gave you an advice you think is bad? Test it before arguing - maybe it was good.
- You posted a claim that something you did not test works? Be prepared to eat your hat.