How to find the accuracy vs number of neighbours for KNN

**scidam** · Apr-10-2019, 03:46 AM

You need to restructure your code significantly. All import statements should be moved to the beginning of the file/document; each part of your code should solve one particular problem and be clear for understanding.

Hide/Show

import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris
from sklearn.neighbors import KNeighborsClassifier
from sklearn.model_selection import train_test_split


# ------------- Data loading section ------------
iris = load_iris()

# -----------------------------------------------


# ----------- Data preparation section ----------

# create X (features) and y (response)

X = iris.data
y = iris.target


# Creating train and test datasets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.4, random_state=0)
# -----------------------------------------------



# ---- Classifier parameter initialization ------

# allowed ranges for nearest neighbor method

k1_range = range(1, 10)
k2_range = range(10, 41, 5)


# you probably need to specify metric type here, e.g.
# metric_type = 'minkowski' and power, e.g. m_power = 2
# Note: minkowski metric with power 2 is eucledean metric.

# -----------------------------------------------


# ----- main computational block goes here ------

scores1 = list()
for k1 in k1_range:
         knn = KNeighborsClassifier(n_neighbors=k1, metric='minkowski', p=2) 
         knn.fit(X_train, y_train)
         y_pred = knn.predict(X_test)
         scores1.append(metrics.accuracy_score(y_test, y_pred))

scores2 = list()
for k2 in k2_range:
         knn = KNeighborsClassifier(n_neighbors=k2, metric='minkowski', p=2)
         knn.fit(X_train, y_train)
         y_pred = knn.predict(X_test)
         scores2.append(metrics.accuracy_score(y_test, y_pred))         

            
# -----------------------------------------------            
            
    
# ----------- plotting obtained results ---------    
plt.figure()
plt.plot(k1_range, scores1)
plt.yticks(np.arange(0.93, 0.98, 0.03))
plt.ylabel('Accuracy')
plt.figure()
plt.plot(k2_range, scores2)
plt.yticks(np.arange(0.91, 0.98, 0.03))
plt.xlabel('Number of neighbors')
plt.ylabel('Accuracy')
plt.show()

# -----------------------------------------------

You still need to tweak the code, add a title to each figure, make some refactoring,
e.g. "minkowski" with p=2 is euclidean distance (that is default).

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Trying to find the next prime number	FirstBornAlbratross	8	4,286	Aug-14-2023, 01:16 PM Last Post: deanhystad
	Python Program to Find the Factorial of a Number	elisahill	2	1,424	Nov-21-2022, 02:25 PM Last Post: DeaD_EyE
	How to find the accuracy for Random Forest	vokoyo	2	3,416	Apr-09-2019, 10:50 PM Last Post: vokoyo
	python age calculator need to find the number of years before they turn 100 not using	orangevalley	4	9,939	Mar-26-2018, 04:44 AM Last Post: PyMan
	dummy classifier accuracy and recall score	metalray	0	4,591	Oct-31-2017, 09:27 AM Last Post: metalray
	how to find a next prime number?	iamyourfather	2	6,508	Oct-01-2017, 04:21 PM Last Post: gruntfutuk
	Neighbours in an array	MattaFX	10	23,221	Jan-26-2017, 02:24 AM Last Post: Mekire

How to find the accuracy vs number of neighbours for KNN

User Panel Messages

Announcements