Oct-15-2019, 02:22 AM
Hi All,
Every now and then I get asked to create a dataset with n number of specified biases, I normally do this is SQL manually with a lot of calculation in Excel. I was wondering if someone has not already built something like this in Python.
Example: have a dataset with lets say 1M records of demographic information like Age, Gender, Race and Country. So the request will look like : Hi JP, please send me a sample dataset that represents the below biases.
70% Female 30% Male
20% Black 80% Rest
18% 18-25 25% 26-35 and 57% rest
50% USA and 10% Africa and 40% Russia
There should be something like this already created? If you know of this please point me in the right direction.
Every now and then I get asked to create a dataset with n number of specified biases, I normally do this is SQL manually with a lot of calculation in Excel. I was wondering if someone has not already built something like this in Python.
Example: have a dataset with lets say 1M records of demographic information like Age, Gender, Race and Country. So the request will look like : Hi JP, please send me a sample dataset that represents the below biases.
70% Female 30% Male
20% Black 80% Rest
18% 18-25 25% 26-35 and 57% rest
50% USA and 10% Africa and 40% Russia
There should be something like this already created? If you know of this please point me in the right direction.