- Use only NumPy; not scikit-learn - Use either randomly generated dataset or simple dataset from kaggle - Pls include EDA section in your notebook for the chosen dataset - Comment on inferences and metrics for evaluation