https://sheldonsebastian.github.io/GW-Data-Science-Datathon/
Path | Description |
---|---|
common | contains utility functions to clean data, perform evaluation, modelling, etc. |
images | contains all saved images |
input_data | contains the input data, cleaned train-test-validation data and feature-target NumPy arrays |
model_trainer | contains model training and feature importance notebooks |
0_preprocessing_eda.ipynb | contains cleaning, preprocessing, stratified split and feature-target separator code |
1_final_report.ipynb | FINAL REPORT |
- Download data from here.
- Run 0_preprocessing_eda.ipynb to preprocess data and create stratified train-test-holdout splits.
- Run all scripts in model_trainer folder to create all the respective models