Python Forum
Research work - help with binary data - Printable Version

+- Python Forum (https://python-forum.io)
+-- Forum: Python Coding (https://python-forum.io/forum-7.html)
+--- Forum: Homework (https://python-forum.io/forum-9.html)
+--- Thread: Research work - help with binary data (/thread-37650.html)



Research work - help with binary data - Newhedgehog6309 - Jul-05-2022

Hello everyone,

I would like to use python to analyze this table (example). But I don't know which technique would be the best for my problem.

Each line corresponds to an observation (450 k lines). Each one has a date, a step, and one or two events (0=No, 1=Yes). I have 7 steps that repeat each year (like seasons) over 4 years, and 8 different events.

My problem : I would like to know a method to calculate the distribution/hierarcy of each event by steps, then per year.

I don't know how to do it. What kind of algorithms would be interesting?

My dependent variable would be "steps".

My independent variables are the events.

I hope to have been clear enough on my problem, it is for my research paper, thank you


RE: Research work - help with binary data - jefsummers - Jul-06-2022

Not sure I fully understand where you are going (I would have thought the independent variable would be date, but looks like you are more interested in grouping). Recommend using Pandas, and recommend looking at the GroupBy series of functions. See HERE