Python Forum

Full Version: Sentiment Analysis Classifier
You're currently viewing a stripped down version of our content. View the full version with proper formatting.

sentence = [sentence.__str__()]
classifier =LogisticRegression()
vectorizer = CountVectorizer()
x, y =data,sentiment

x_train, x_test,y_train, y_test = train_test_split(X,y)
sentiment_pipeline =Pipeline([('vectorizer',vectorizer),('classifier',classifier)])
sentiment_pipeline, confusion_matrix = cross_val_score(sentiment_pipeline, x_train, x_test )

My goal is to find the sentiment of the variable named 'sentence'. I get: ValueError: Found input variables with inconsistent numbers of samples: [1, 3] as an error. Any help would be greatly appreciated.

Imports concerning this is as follows:
from sklearn.feature_extraction.text import CountVectorizer
from nltk.sentiment import SentimentAnalyzer
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.pipeline import Pipeline
from sklearn.model_selection import cross_val_score