Python Forum
Multivariate Analysis of Ecology Data
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Multivariate Analysis of Ecology Data
#1
I'm writing code to do statistical analysis of ecological data as part of a voluntary research project. Where the independent variables are continuous I'm planning to do multivariate analysis (LDA or PCA analysis) on it using Python libraries - where the independent variables are categorical I'm not sure of the best techniques to use - I was wondering if anyone else has experience of this or could provide any advice?

The biologists and zoologists I'm working with are experts in their field but don't have any background in statistics or computing - the other ecologists and zoologists I know tend to use R but my background is more in .Net and Python.

Given the power of the scientific data analysis libraries in Python - particularly pandas, numpy, scipy and scikit-learn and the graphical display libraries - particularly matplotlib, seaborn and plotnine - and of supporting technologies like pandas, Jupyter Notebook and JupyterLab - I was planning to do the analysis in Python.

In the area we're looking at there aren't many a priori theories which can be expressed as equations so I think the analysis will be primarily exploratory in nature with a lot of the results displayed graphically.

For multivariate analysis in Python I found this:

https://github.com/gatsoulis/a_little_bo...ysis.ipynb

it was written using older versions of the libraries and didn't work using the current versions but I amended it to work with current versions. An expert in Python in molecular biology and genomics however pointed out that this tutorial manually codes calculations which more modern versions of pandas and scikit-learn can do out-of the box.

I haven't been able to find many other resources on multivariate analysis in Python unfortunately.

It would be great to get input and some pointers from other people using Python in ecology and zoology! python Smile
Reply
#2
See: https://pypi.org/search/?q=multivariate+analysis
I haven't used any of these packages, so you will have to explore/test those that look useful for your project.
Reply
#3
Not sure I understand completely, but scikit-learn does multiple regression analysis. I used it for an analysis of factors in (medical residency) rank order looking at factors such as location of medical school, test scores, sex, age, strength of interview, etc. Is that close to the problem? https://www.geeksforgeeks.org/multiple-l...kit-learn/ is an instructional post. Happy to help if this will work.
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Neural network and data analysis from clients survey result pthon3 2 1,914 Mar-17-2022, 02:21 AM
Last Post: jefsummers
  Easy analysis of Data ranjjeetk 1 1,927 Jun-06-2020, 01:44 AM
Last Post: Larz60+
  Basic data analysis and predictions mates 15 6,950 Mar-14-2020, 09:06 PM
Last Post: mates
  complex survey data analysis abeshkc 1 2,816 Nov-06-2019, 06:14 AM
Last Post: ThomasL

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020