Python Forum
How to test likelihood hypothesis on dataset?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
How to test likelihood hypothesis on dataset?
#1
How to test the following hypothesis? Customers with larger fares are more likely to be travailing alone than smaller ones.
Using the data below, is a common statistical test appropriate for this hypothesis?

Given were comparing different segments(high vs low) of fare, surely tests using means aren't helpful? Thus is there an alternative to statistical test package for validating this hypothesis.

Common Statistical tests
> T-test: compare two groups/categories of numeric variables with small sample size​
1. one sample t-test: test the mean of one group against a constant value​
2. two sample t-test: test the difference of means between two groups​
3. paired sample t-test: test the difference of means between two measurements of the same subject​
> Z-test: compare two groups/categories of numeric variables with large sample size​
> ANOVA test: compare the difference between two or more groups/categories of numeric variables​
> Chi-Squared test: examine the relationship between two categorical variables​
> Correlation test: examine the relationship between two numeric variable​

# package
import seaborn as sns

#load dataset
df= sns.load_dataset('titanic')
df = df[['fare','alone']]

#dataset
fare alone
0 7.2500 False
1 71.2833 False
2 7.9250 True
3 53.1000 False
4 8.0500 True
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Partitioning when splitting data into train and test-dataset Den0st 0 1,989 Dec-07-2019, 08:31 PM
Last Post: Den0st
  Join Predicted values with test dataset bhuwan 4 10,425 Mar-28-2019, 12:42 AM
Last Post: bhuwan

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020