Inputs: Datasets from data folder
Outputs: allData.csv, data_NAandSteamOmitted.csv
Goal: take heat vulnerability-relevant datasets and create a .csv file with relevant variables across time and ZIP code.
Goal: prepare tree counts and 311 calls datasets for use in aggregateData.R
Inputs: data_NAandSteamOmitted.csv
Outputs: Decision tree, random forest, and XGBoost models for temporal HVI
Goal: Use machine learning to create baseline models with three different methods to predict HVI temporally for each ZIP code. Ground-truth these models against the given HVI from the NYC government and see if HVI changes temporally.
Necessary dependencies for .ipynb files
Stores .csv files for all used datasets