- Analyzed, Cleaned and Visualised dataset of different patients.
- Built a ML-Model using Python and Machine Learning that predicts whether a person is diabetic or not
- A Confusion matrix is an N x N matrix used for evaluating the performance of a classification model, where N is the number of target classes.
- The matrix compares the actual target values with those predicted by the machine learning model.
- Decision Tree is a Supervised learning technique that can be used for both classification and Regression problems, but mostly it is preferred for solving Classification problems.
- It is a tree-structured classifier, where internal nodes represent the features of a dataset, branches represent the decision rules and each leaf node represents the outcome.
- Feature importance refers to techniques that assign a score to input features based on how useful they are at predicting a target variable.
- The optimal value of k reduces effect of the noise on the classification, but makes boundaries between classes less distinc.
- Elbow method helps data scientists to select the optimal number of clusters for KNN clustering.
- Clustering is the task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group and dissimilar to the data points in other groups.
- It is basically a collection of objects on the basis of similarity and dissimilarity between them.