This repository contains course material for the LDSA Spark module. The most important resources in this repo are:
/data: this directory contains datasets for the lecture examples and for the assignmentLab Assignment.ipynb: lab assignment for the LDSA Spark moduleLecture Examples.ipynb: some Spark examples from the lecture