(TP + FN)

These metrics help determine how well the model classifies diabetes-positive and -negative cases.

What are the steps to compute confusion matrix, accuracy, precision, and recall for a KNN model applied on the diabetes dataset ?

Train/Test Split: Divide the dataset into training (e.g., 80%) and test (20%) sets.
Model Training: Fit the KNeighborsClassifier on the training set using scikit-learn.
Prediction: Predict outcomes on the test set.

Compute Metrics:

from sklearn.metrics import confusion_matrix, accuracy_score, precision_score, recall_score

# Example values
y_true = [0, 1, 1, 0, 1]
y_pred = [0, 1, 0, 0, 1]

conf_matrix = confusion_matrix(y_true, y_pred)
accuracy = accuracy_score(y_true, y_pred)
precision = precision_score(y_true, y_pred)
recall = recall_score(y_true, y_pred)

These metrics will give insights into model performance and help compare with other classifiers.

Programming Code:

Following code write in: ML_P05.py

# ML Project Program 05 

# K-Nearest Neighbors Algorithm on diabetes.csv dataset
import pandas as pd
import numpy as np

data = pd.read_csv("./diabetes_dataset/diabetes.csv")
data
data.info()
data.describe()
data.columns
# Checking null values

data.isnull().sum()
# create variables
data_x = data.drop(columns = "Outcome", axis=1)
data_y = data['Outcome']
data.shape
data_x.shape , data_y.shape
from sklearn.preprocessing import StandardScaler
scale = StandardScaler()
scaledX = scale.fit_transform(data_x)

# split into Train & Test 
from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(scaledX, data_y, test_size = 0.2,)
# Machine Learning Model - KNN
from sklearn.neighbors import KNeighborsClassifier

knn = KNeighborsClassifier(n_neighbors = 7)

knn.fit(x_train, y_train)
y_pred = knn.predict(x_test)
from sklearn import metrics

# Confusion Matrix

cs = metrics.confusion_matrix(y_test, y_pred)

print("Confusion Matrix is : \n", cs)
# Accuracy score

ac = metrics.accuracy_score(y_test, y_pred)

print("Accuracy score is : ", ac)                # Model Accuracy is 69%
# Error Rate

er = 1 - ac

print("Error rate is : ", er)           # Error Rate is : 0.305
# Precision

p = metrics.precision_score(y_test, y_pred)

print("Precision: ", p)
#  Recall

r = metrics.recall_score(y_test, y_pred)

print("Recall: ", r)
# Precision score is: 0.607            &
# Recall score is: 0.534
# Thanks for Watching

# Thanks For Reading.

Output:

Machine Learning Program / Project - 05

Posted by go2collage

Post a Comment

0 Comments

Search This Blog

Most Popular

C# Error 02: The process cannot access the file because it is being used by another process in C#

Flutter Error 02: Failed to load FirebaseOptions from resource Check that you have defined ...

Console App Task 23: How to print a table name along with its contents using Console.WriteLine in C#

Featured Post

Machine Learning Program / Project - 08

Program / Project Code

Pages

Footer Menu Widget

Contact form

Ad Code