Python Forum
How to summarize survey requirement data?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
How to summarize survey requirement data?
#1
Hi All,
I’m currently trying to find out an algorithmic approach to find the most in demand profile group for a set of survey data, this is example of the dataset I’m working with:

Survey |Demand |Age |Gender
----------------------------------------
1 |5 |[18,19,20] |[1,2]
2 |10 |[19,20,21] |[1,2]
3 |5 |[20,21,22] |[1,2]

• There are 3 surveys to be done
• Survey 1 needs 5 people to complete it, and the requirements are that they are aged 18,19 or 20 and male/female

So, from visually looking at the data the most in demand profile is people aged 20, followed by those aged 19 or 21.

However, I need to be able to run this on a larger dataset, so I’m looking for an algorithmic approach to find the same conclusion as above.
What I’ve tried so far is:
• Creating unique combinations of all possible answers and counting demand by combination – this unfortunately doesn’t scale very well as surveys can have multiple questions with 30+ options, so the number of combinations becomes unusable.
• I’ve thought about using a decision tree, which given that it uses a gini index, could be very useful to determine which questions are the most significant drivers of demand. However, I’ve been unable to find a way to fit the model given that my dataset stores a range of possible options.

Other assumptions to consider:
• The example above is a simplification with only 2 questions, there are many more questions in the dataset
• Some will have a larger range of answers (e.g., 30+)
• Not all surveys would have the same questions (i.e., some will ask more/less questions than others)

Any help, thoughts or general guidance would be greatly appreciated!
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Neural network and data analysis from clients survey result pthon3 2 1,916 Mar-17-2022, 02:21 AM
Last Post: jefsummers
  Can python read Marathi text files and summarize them? mcp111 0 1,842 Mar-18-2020, 08:58 AM
Last Post: mcp111
  complex survey data analysis abeshkc 1 2,816 Nov-06-2019, 06:14 AM
Last Post: ThomasL

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020