Python Forum
Found input variables with inconsistent numbers of samples: [1000, 200] - Printable Version

+- Python Forum (https://python-forum.io)
+-- Forum: Python Coding (https://python-forum.io/forum-7.html)
+--- Forum: General Coding Help (https://python-forum.io/forum-8.html)
+--- Thread: Found input variables with inconsistent numbers of samples: [1000, 200] (/thread-34913.html)



Found input variables with inconsistent numbers of samples: [1000, 200] - jenya56 - Sep-14-2021

Hi all,
I have this code that compiles all the way to the end. Once the function returns it goes though some accuracy calculations that are not set by me and hence I cannot really debug it. Though it might be something in my code.
        
        # Encoding categorical data in y
        labelencoder_y = LabelEncoder()
        y = labelencoder_y.fit_transform(y)

        self.X_train, self.X_test, self.y_train, self.y_test = train_test_split(X, y, test_size=0.2, random_state = 40)

        sc = StandardScaler()
        self.X_train = sc.fit_transform(self.X_train)
        self.X_test = sc.transform(self.X_test)
         

        # Fit to the training data
        self.clf.fit(self.X_train, self.y_train)
        
        y_pred = self.clf.predict(self.X_test)
        print(accuracy_score(self.y_test, y_pred))
        result = np.array(y_pred, dtype=bool)
        #resulti = resulti.reshape((200,1))
        result
        print("result")
        print(result.shape)
        print('ypred')
        print(y_pred.shape)
        print('y_train')
        print(self.y_train.shape)
        print('X_train')
        print(self.X_train.shape)
        print("WE ARE DONE")
return result
result shape is
(200,)
ypred shape is
(200,)
y_train shape is
(800,)
X_train shape is
(800, 61)
WE ARE DONE
And the code throws the error as Found input variables with inconsistent numbers of samples: [1000, 200] where my result variable is of shape (200,). Though my datataframe is of size 1000x13. Any suggestions? THanks
Jenya


RE: Found input variables with inconsistent numbers of samples: [1000, 200] - ibreeden - Sep-15-2021

Hi Jenya56, you are not making it easy for us to understand the problem. First: if there is an error message you should post the complete message (in error tags). Second: the code you show us appears to generate no errors, but you mention vaguely there are accuracy calculations running after your code. Apperantly the error comes from those calculations.
You mention:
(Sep-14-2021, 09:11 PM)jenya56 Wrote: Once the function returns
... but there is no function in your code, and then of course no return statement. My guess is you should write this as a function, returning values to the accuracy calculations.


RE: Found input variables with inconsistent numbers of samples: [1000, 200] - jenya56 - Sep-15-2021

(Sep-15-2021, 09:48 AM)ibreeden Wrote: Hi Jenya56, you are not making it easy for us to understand the problem. First: if there is an error message you should post the complete message (in error tags). Second: the code you show us appears to generate no errors, but you mention vaguely there are accuracy calculations running after your code. Apperantly the error comes from those calculations.
You mention:
(Sep-14-2021, 09:11 PM)jenya56 Wrote: Once the function returns
... but there is no function in your code, and then of course no return statement. My guess is you should write this as a function, returning values to the accuracy calculations.

Hi! I actually found the problem. For reference, I did not have to do splitting of data. My y_pred was not equal to y (shape wise) and was complaining about that! THANK YOU