Python Forum
Basic data analysis and predictions
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Basic data analysis and predictions
#15
Spent some time on this tonight. Note that in my version I read the csv differently, from a different source (my Google drive). It still has errors, but the plot at the end shows the modeled values - next step is to plot the actual vs the modeled, and do stats on them if you want
# Load libraries
from pandas import read_csv
from pandas.plotting import scatter_matrix
from matplotlib import pyplot
from sklearn.model_selection import train_test_split
from sklearn.model_selection import cross_val_score
from sklearn.model_selection import StratifiedKFold
from sklearn.metrics import classification_report
from sklearn.metrics import confusion_matrix
from sklearn.metrics import accuracy_score
from sklearn.linear_model import LogisticRegression
from sklearn.tree import DecisionTreeClassifier
from sklearn.neighbors import KNeighborsClassifier
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis
from sklearn.naive_bayes import GaussianNB
from sklearn.svm import SVC
import pandas as pd
from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression

# Load dataset
from google.colab import drive
drive.mount('/content/drive')
filename = (r'/content/drive/My Drive/analyza_casovych_radov.csv')
cols = ['Rok', 'Pocet prepravenych cestujucich', ]
dataset = pd.read_csv(filename, names=cols)
df = dataset

trainval_dataset = df.sample(frac=0.8,random_state=42)
test_dataset = df.drop(trainval_dataset.index)
train_dataset = trainval_dataset.sample(frac=0.8, random_state=42)
validate_dataset = trainval_dataset.drop(train_dataset.index)
print ()
print(f"Train {train_dataset.shape} Validate {validate_dataset.shape} Test {test_dataset.shape}")

print ()
print ()

print ('train_dataset= ')  
print (train_dataset)

print ()
print ('test_dataset= ')  
print (test_dataset)

print ()
print ('validate_dataset= ')
print (validate_dataset)

print()

X = train_dataset['Rok']
y = train_dataset['Pocet prepravenych cestujucich']


poly = PolynomialFeatures(2)

X_poly = poly.fit_transform(X.to_frame().values.reshape(-1, 1))
poly.fit(X_poly, y) 
lin2 = LinearRegression() 
lin2.fit(X_poly, y) 

plt.scatter(X.values, y, color = 'blue') 
  
plt.plot(X.values, lin2.predict(poly.fit_transform(X_poly)), color = 'red') 
plt.title('Polynomial Regression') 
plt.xlabel('Rok') 
plt.ylabel('Other') 
  
plt.show()
#print (poly.fit_transform(X))

#plt.scatter(X, y, color = 'blue')
#plt.plot(X, (poly.fit_transform(X)), color = 'red')
#plt.show()
Output:
--------------------------------------------------------------------------- ValueError Traceback (most recent call last) <ipython-input-20-b5736948e378> in <module>() 64 plt.scatter(X.values, y, color = 'blue') 65 ---> 66 plt.plot(X.values, lin2.predict(poly.fit_transform(X_poly)), color = 'red') 67 plt.title('Polynomial Regression') 68 plt.xlabel('Rok') 2 frames /usr/local/lib/python3.6/dist-packages/sklearn/utils/extmath.py in safe_sparse_dot(a, b, dense_output) 149 ret = np.dot(a, b) 150 else: --> 151 ret = a @ b 152 153 if (sparse.issparse(a) and sparse.issparse(b) ValueError: matmul: Input operand 1 has a mismatch in its core dimension 0, with gufunc signature (n?,k),(k,m?)->(n?,m?) (size 3 is different from 10)
Reply


Messages In This Thread
Basic data analysis and predictions - by mates - Mar-06-2020, 11:57 PM
RE: Basic data analysis and predictions - by mates - Mar-07-2020, 07:08 AM
RE: Basic data analysis and predictions - by mates - Mar-07-2020, 11:29 PM
RE: Basic data analysis and predictions - by mates - Mar-08-2020, 09:49 AM
RE: Basic data analysis and predictions - by mates - Mar-08-2020, 04:42 PM
RE: Basic data analysis and predictions - by mates - Mar-08-2020, 06:19 PM
RE: Basic data analysis and predictions - by mates - Mar-10-2020, 02:15 PM
RE: Basic data analysis and predictions - by mates - Mar-11-2020, 09:01 PM
RE: Basic data analysis and predictions - by jefsummers - Mar-12-2020, 12:46 AM
RE: Basic data analysis and predictions - by mates - Mar-14-2020, 09:06 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Multivariate Analysis of Ecology Data Will_Robertson 2 956 Aug-04-2023, 11:19 AM
Last Post: jefsummers
  Neural network and data analysis from clients survey result pthon3 2 1,930 Mar-17-2022, 02:21 AM
Last Post: jefsummers
  HELP- DATA FRAME INTO TIME SERIES- BASIC bntayfur 0 1,778 Jul-11-2020, 09:04 PM
Last Post: bntayfur
  How to save predictions made by an autoencoder Glasgow1988 0 1,586 Jul-03-2020, 12:43 PM
Last Post: Glasgow1988
  Easy analysis of Data ranjjeetk 1 1,934 Jun-06-2020, 01:44 AM
Last Post: Larz60+
  Utilize input predictions for Supervised Learning donnertrud 2 1,917 May-20-2020, 12:45 PM
Last Post: donnertrud
  complex survey data analysis abeshkc 1 2,831 Nov-06-2019, 06:14 AM
Last Post: ThomasL
  Merge Predictions with whole data set mayanksrivastava 0 3,620 Jun-29-2017, 11:39 AM
Last Post: mayanksrivastava

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020