Python Forum
Create dataset with (n) known biases
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Create dataset with (n) known biases
#1
Hi All,

Every now and then I get asked to create a dataset with n number of specified biases, I normally do this is SQL manually with a lot of calculation in Excel. I was wondering if someone has not already built something like this in Python.

Example: have a dataset with lets say 1M records of demographic information like Age, Gender, Race and Country. So the request will look like : Hi JP, please send me a sample dataset that represents the below biases.

70% Female 30% Male
20% Black 80% Rest
18% 18-25 25% 26-35 and 57% rest
50% USA and 10% Africa and 40% Russia

There should be something like this already created? If you know of this please point me in the right direction.
Reply


Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020