SonarProject

How to download the repository:

git clone https://github.com/anmancuso/SonarProject

Required Packages:

numpy
pandas
seaborn
matplotlib
sklearn

(The code has been developed with python3.8)

How to run the project:

The project is composed by three notebooks:

Exploring_the_dataset.ipynb -> Data Visualization
Rock_vs_Mine_binary_classification_without_feature_reduction.ipynb -> ML without any data pre-processing
Rock_vs_Mine_binary_classification_with_feature_reduction.ipynb -> ML with feature selection and data pre-processing This is the main script!

Then there are two additional .py scripts to plot and to print the results.

Description of the Project

The dataset under study is a famous dataset used by Gorman, R. P., and Sejnowski, T. J. (1988) in their “Analysis of Hidden Units in a Layered Network Trained to Classify Sonar Targets” in Neural Networks, Vol. 1, pp. 75–89. As the name suggests, the aim of this project is to use the data provided by a sonar to distinguish between rocks and metal objects such as mines. The dataset is made of 60 features (strenght of bouncing signals at 60 different angles) with 208 observations. For each observation we know the outcome as a Rock "R" or a Mine "M".

In the Exploring_the_dataset.ipynb notebook, the data are studied by visualizing the box plots to check the precesence of possible outliers and to understand the general trend of the data. Of course here the data under studied are only the train-data obtained with a train test split with size 0.2 (20% of the dataset is used to test the learning). From this check, no particular evidences can be pointed out.

In the first attempt to face the classification problem, the data have not been processed (except for the initial Encoding of the result of the observation : "R" is the class 1 and "M" is the class 0).

The algorithm used here for the train are:

I have decided to use different algorithms in order to compare the results in terms of Accuracy, Precision, AUC and Training Time. In particular from the Accuracy shown in the following figure:

one can conclude that even without Data Pre-processing the results of the prediction are acceptable (higher than 80%) with the SVC algorithm.

Of course by looking at the results obtained by facing the classification problem with feature selection and data preparation, one realize that the performaces considerably improve. The Data Preparation consist of the standardization of the observation (that is to have a dataset of Mean 0 and STD 1). For the feature selection instead, the algorithm used are:

After a first check of the distribution of the feature importance to optimize the number of feature to keep, I have implemented the train by means of Pipelines built with two steps:

Data Processing
Training fit

By doing so for all the Algorithm chosen and for all the Feature selection functions, I have compared all the results, which can be summarized in the following plots:

Among the Algorithm Tested, for sure the linear Support Vector Classifier with a Feature Reduction or with a Data Standardization, give the best results in terms of Accuracy and AUC.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.ipynb_checkpoints		.ipynb_checkpoints
plots		plots
Exploring_the_Dataset.ipynb		Exploring_the_Dataset.ipynb
README.md		README.md
Rock_vs_Mine_binary_classification_with_feature_reduction.ipynb		Rock_vs_Mine_binary_classification_with_feature_reduction.ipynb
Rock_vs_Mine_binary_classification_without_feature_reduction.ipynb		Rock_vs_Mine_binary_classification_without_feature_reduction.ipynb
my_plotting.py		my_plotting.py
my_printing.py		my_printing.py
results.csv		results.csv
sonar.csv		sonar.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SonarProject

How to download the repository:

Required Packages:

How to run the project:

Description of the Project

About

Releases

Packages

Languages

anmancuso/SonarProject

Folders and files

Latest commit

History

Repository files navigation

SonarProject

How to download the repository:

Required Packages:

How to run the project:

Description of the Project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages