The prerequisites for this course are Calculus I, Data Structures, and Discrete Mathematics (this last is a co-req).
This course will introduce core principles of learning from data. More and more decisions are being made by algorithms that operate on large datasets, and this course will give students the tools to understand and contribute to this process. Throughout we will emphasize the ethical use of data and analyze case studies of how data science has intersected with society. This course will have a significant theory component, covering introductory linear algebra, probability, statistics, modeling, information theory, and optimization. However, we will also implement these ideas (in Python) and apply them to concrete datasets from a variety of fields (including images, video, text, DNA, music, art, etc).
The language for this course is Python 3.
See the Schedule for each week's reading assignment.
The schedule is tentative and subject to change throughout the semester.
1 | Sep 03 | Introduction to Data Science and Python
Reading:
| Tue: |
Lab 1: Computing and plotting in Python (due Sep 10) |
Sep 04 | Wed: | |||
2 | Sep 09 | Introduction to Modeling
Reading:
| Mon: |
Lab 2: Modeling climate change (due Sep 16) |
Sep 11 | Wed: | |||
3 | Sep 16 | Applied Linear Algebra and Optimization
Reading:
| Mon: |
Lab 3: Gradient Descent (due Sep 23) Last day to drop (Sep 20) |
Sep 18 | Wed: | |||
4 | Sep 23 | Evaluation Metrics
Reading:
| Mon: |
Lab 4: Evaluation Metrics (due Sep 30) |
Sep 25 | Wed: | |||
5 | Sep 30 | Probabilistic modeling I (+ review)
Reading:
| Mon: |
Midterm 1 |
Oct 02 | Tue+Wed: | |||
6 | Oct 7 | Ethics: Disparate Impact
Reading:
| Mon: |
Lab 5: Naive Bayes (due Oct 22) |
Oct 9 | Wed: | |||
Oct 14 | Fall Break | |||
Oct 16 | ||||
7 | Oct 21 | Information theory
Reading:
| Mon: |
Lab 6: Information Theory (due Oct 28) |
Oct 23 | Wed: | |||
8 | Oct 28 | Probabilistic modeling II + Visualization
Reading:
| Mon: |
Lab 7: Logistic Regression and Visualization (due Nov 4) Final Project proposal (due Nov 8) |
Oct 30 | Wed: | |||
9 | Nov 04 | Introduction to statistics
Reading:
| Mon: | |
Nov 06 | Wed: | |||
10 | Nov 11 | Introduction to statistics II (+ review)
Reading: | Tue: |
Lab 8: Statistics and Visualization (due Nov 18) |
Nov 13 | Wed: | |||
11 | Nov 18 | Midterm II review
Reading: | Mon: |
Midterm 2 |
Nov 20 | Wed: | |||
12 | Nov 25 | Unsupervised learning
Reading:
| Mon: | |
Nov 27 | ||||
13 | Dec 02 | Intro to neural networks
Reading:
| Mon: |
Final Project Presentation and Deliverables (GitHub repos due 12pm Dec 20) |
Dec 04 | Wed: | |||
14 | Dec 09 | Project Presentations
| Last day to pass/fail (Dec 13) | |
Dec 11 |