Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

To change the LSTM into using numbers and not the discretization #184

Open
AlyaGomaa opened this issue Feb 10, 2023 · 1 comment · May be fixed by #447
Open

To change the LSTM into using numbers and not the discretization #184

AlyaGomaa opened this issue Feb 10, 2023 · 1 comment · May be fixed by #447
Assignees
Labels
Enhancement Machine Learning Needs knowledge of Machine Learning

Comments

@AlyaGomaa
Copy link
Collaborator

Created by Alya Gomaa via monday.com integration. 🎉

@eldraco eldraco added the Machine Learning Needs knowledge of Machine Learning label Feb 24, 2023
@tahifahimi tahifahimi linked a pull request Feb 6, 2024 that will close this issue
7 tasks
@tahifahimi
Copy link

tahifahimi commented Feb 28, 2024

In the following, we present the performance evaluation of four different machine learning models trained on a dataset (”modules/rnn_cc_detection/datasets/dataset_more_labels.dat”). The models evaluated include Random Forest, Support Vector Machine (SVM), k-Nearest Neighbors (KNN), and Recurrent Neural Network (RNN). The dataset comprises features labelled with binary classes.
#316 is proposed using one-hot encoding, but in the following, we used the StratoLetter mapping to integers.

Model Overview:

  1. Random Forest:
    • Accuracy: 1
    • Methodology: The Random Forest classifier achieved an accuracy of 100% and an F1 score of 1. It was trained using 100 decision trees.
  2. Support Vector Machine (SVM):
    • Accuracy: 0.8461
    • F1 Score: 0.9166
    • Methodology: The SVM model, utilizing a radial basis function kernel, attained an accuracy of 84% and an F1 score of 0.91. The features were scaled using StandardScaler.
  3. k-Nearest Neighbors (KNN):
    • Accuracy: 0.7692
    • F1 Score: 0.8695
    • Methodology: The KNN classifier with 5 neighbors achieved an accuracy of 76% and an F1 score of 0.86. The features were scaled using StandardScaler.
  4. Recurrent Neural Network (RNN):
    • Accuracy: 0.8461
    • Loss: 0.6770
    • Methodology: The RNN model, a Bidirectional GRU with dropout layers, achieved an accuracy of 84% on the test dataset. It was trained for 10 epochs using RMSprop optimizer.

Discussion:

  • The Random Forest model demonstrated the highest accuracy among the traditional machine learning models evaluated, achieving 100% accuracy.
  • The dataset has 62 records. It is expected that by increasing the number of records, the model's accuracy will increase.
  • All models were trained and tested on the same dataset split, ensuring fair comparison of their performance metrics.

Details are available at 7fbc2ce

@AlyaGomaa AlyaGomaa added this to Slips Jul 12, 2024
@github-project-automation github-project-automation bot moved this to Todo in Slips Jul 12, 2024
@eldraco eldraco added this to the Fix the ML models milestone Jul 27, 2024
@eldraco eldraco self-assigned this Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Enhancement Machine Learning Needs knowledge of Machine Learning
Projects
Status: Todo
Development

Successfully merging a pull request may close this issue.

3 participants