Skip to content

davidit33/SCADA-ML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

Divergent Results in Replicating 'SCADA System Testbed for Cybersecurity Research Using Machine Learning Approach'

As a student, I attempted to replicate the experiment presented in the article titled "SCADA System Testbed for Cybersecurity Research Using Machine Learning Approach," published in 2018. While I found that the results for the algorithms Decision Tree, Random Forest, and Logistic Regression were similar to those reported by the researchers, the accuracy obtained for Naive Bayes was significantly lower, which is why it was not included.

Additionally, I found it puzzling that the KNN algorithm was used in a dataset without feature scaling.

The article acknowledged that the dataset used in the study was unbalanced, yet the researchers still used accuracy as the primary performance metric. In my replication attempt, I used "f1_score" as a metric to account for the dataset's imbalance. However, it was unclear to me if the evaluation metric used in Figure 8 considered the imbalance in the dataset.

👨‍💻 You can see the notebook here: Kaggle

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published