Skip to content

Analyzed, Cleaned and Visualised dataset of different patients. Built a ML-Model using Python and Machine Learning that predicts whether a person is diabetic or not

Notifications You must be signed in to change notification settings

Neet2202/Diabetes-Prediction-Model-Using-Multiple-Algorithms

Repository files navigation

Diabetes-Prediction-Model-

  • Analyzed, Cleaned and Visualised dataset of different patients.
  • Built a ML-Model using Python and Machine Learning that predicts whether a person is diabetic or not

Confusion Matrix

  • A Confusion matrix is an N x N matrix used for evaluating the performance of a classification model, where N is the number of target classes.
  • The matrix compares the actual target values with those predicted by the machine learning model. image

Decision Tree

  • Decision Tree is a Supervised learning technique that can be used for both classification and Regression problems, but mostly it is preferred for solving Classification problems.
  • It is a tree-structured classifier, where internal nodes represent the features of a dataset, branches represent the decision rules and each leaf node represents the outcome. image

Accuracy Using Decision Tree

image

Feature Importance Score

  • Feature importance refers to techniques that assign a score to input features based on how useful they are at predicting a target variable. image

Accuracy Using Random Forest Classification

image

Error Rate Vs K Rate

  • The optimal value of k reduces effect of the noise on the classification, but makes boundaries between classes less distinc.
  • Elbow method helps data scientists to select the optimal number of clusters for KNN clustering. image

Accuracy Using K-Nearest Neighbors(KNN)

image

Clustering (No. of Clusters)

  • Clustering is the task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group and dissimilar to the data points in other groups.
  • It is basically a collection of objects on the basis of similarity and dissimilarity between them. image

K-Means ( Cluster)

  • K-Means Clustering is an Unsupervised Learning algorithm, which groups the unlabeled dataset into different clusters.
  • Here K defines the number of pre-defined clusters that need to be created in the process, as if K=2, there will be two clusters, and for K=3, there will be three clusters, and so o image

Accuracy Using Naive Bayes Algorithm

image

Accuracy Using Support Vector Machine (Linear Kernel)

image

Accuracy Using Support Vector Machine (Poly Kernel)

image

Accuracy Using Support Vector Machine (Gaussian Kernel)

image

Accuracy Using Support Vector Machine (Sigmoid Kernel)

image

Accuracy Using Artificial Neural Network (ANN)

image

About

Analyzed, Cleaned and Visualised dataset of different patients. Built a ML-Model using Python and Machine Learning that predicts whether a person is diabetic or not

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published