Skip to content

nimaij/SYD_DAT_6

 
 

Repository files navigation

Data Science Part Time Course - SYD DAT 6

ga-logo

Instructor: Alasdair Douglas

Teaching Assistant: Louis Tsang

Location: Level M, 56-58 York St Sydney NSW 2000

ga-location

Dates: 10/10/2016 - 14/12/2016

Time: 6:00 p.m. - 9:00 p.m., Monday and Wednesday evenings

Schedule

week Monday Wednesday
1 10/10: Introduction 12/10: Basics of Data Science with Python and Git
2 17/10: Data Visualisation 19/10: Linear Regression
3 24/10: Logistic Regression 26/10: Model Evaluation
4 31/10: Regularisation & Dimensionality Reduction 02/11: Clustering
5 07/11: Decision Trees 09/11: Random Forest & Ensembling
6 16/11: Recommendation Engines 21/11: Cloud Computing, Big Data and Spark
7 23/11: Natural Language Processing 26/10: Graphs & Network Analysis
8 28/11: Time Series 30/11: Causality
9 05/12: Communication 07/12: Neural Networks & Deep Learning
10 12/12: Course Review & Project Presentations 14/12: Project Presentations

Pre-Work

Installation and Setup

  • Install the Anaconda distribution of Python 2.7x.
  • Install Git and create a GitHub account.
  • Once you receive an email invitation from Slack, join our "SYD_DAT_6 team" and add your photo!

Resources

Readings

Optional

You're also more than welcome to do the following if you're keen to get extra advanced for your first class:


Lesson 1: Introduction

  • [Slides](/slides/Data Science Week 1 - Monday 10 October - Intro.pdf)
  • Lab
  • Introduction to General Assembly and the Data Science Part Time course
  • Course overview: our philosophy and expectations
  • Agree on a way of working
  • Tools: check for proper setup of Git, Anaconda, overview of Slack

Homework:

  • Resolve any installation issues before next class.
  • Make sure you have a github profile and have forked this repo "SYD_DAT_6"
  • Clone the fork you created on your github profile to your computer
  • Complete the brief skill survey after Lesson 1 https://goo.gl/forms/sCot5Y4CfuO1Oacg2

Optional:


Lesson 2: Data science basics and Git

<<<<<<< HEAD

Class 2: Data science basics and Git

=======

upstream/master

  • [Slides](/slides/Data Science Week 1 - Wednesday 12 October.pdf)
  • Lab
  • What is data science
  • What does a Data Scientist need to succeed
  • How does a data science project flow
  • What is Git
  • Using Git bash
  • Using Git for version control and collaboration
  • Using the Pandas package for data manipulation in Python

Homework:

<<<<<<< HEAD

Class 2: Data science basics and Git

=======

Lesson 3: Data Visualisation

upstream/master

  • [Slides](/slides/Data Science Week 2 - Monday 17 October - Data Visualisation.pdf)
  • Lab
  • What is data visualisation
  • Why we visualise data
  • How do we visualise data
  • Philosophies on visualising data
  • Git sync with upstream, make changes, push and make pull request
  • Git lab
  • Visualisation Lab

Homework:

  • Homework1.ipynb which is located in the homework folder. Due Friday the 21st of October
  • Reading chapter 3 of an Introduction to Statistical Learning on Linear Regression <<<<<<< HEAD =======

Lesson 4: Linear Regression

  • Slides
  • Lab
  • Understand the differences between supervised and unsupervised learning
  • Describe the process of building a linear regression model
  • Build a linear regression model and interpret the output

Homework:


upstream/master

About

Sydney Data Science with General Assembly

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.1%
  • CSS 1.0%
  • HTML 0.4%
  • Python 0.3%
  • R 0.1%
  • Ruby 0.1%