Research work - help with binary data - Printable Version +- Python Forum (https://python-forum.io) +-- Forum: Python Coding (https://python-forum.io/forum-7.html) +--- Forum: Homework (https://python-forum.io/forum-9.html) +--- Thread: Research work - help with binary data (/thread-37650.html) |
Research work - help with binary data - Newhedgehog6309 - Jul-05-2022 Hello everyone, I would like to use python to analyze this table (example). But I don't know which technique would be the best for my problem. Each line corresponds to an observation (450 k lines). Each one has a date, a step, and one or two events (0=No, 1=Yes). I have 7 steps that repeat each year (like seasons) over 4 years, and 8 different events. My problem : I would like to know a method to calculate the distribution/hierarcy of each event by steps, then per year. I don't know how to do it. What kind of algorithms would be interesting? My dependent variable would be "steps". My independent variables are the events. I hope to have been clear enough on my problem, it is for my research paper, thank you RE: Research work - help with binary data - jefsummers - Jul-06-2022 Not sure I fully understand where you are going (I would have thought the independent variable would be date, but looks like you are more interested in grouping). Recommend using Pandas, and recommend looking at the GroupBy series of functions. See HERE |