Course Schedule

Part 1: Resources and Deployment

Week 1

Mon, Jan 23
No Class
Wed, Jan 25
Course Intro
Read: Syllabus
Watch: Lecture
Slides: PDF
Fri, Jan 27
Deployment (Linux Shell)
Watch: Lecture
Slides: PDF
Worksheet: PDF

Week 2

Wed, Feb 1
Memory Resources (Caching)
Read: Systems Performance, Gregg (6.2.2; "CPU Caches" and "Latency" subsections of 6.4.1)
Watch: Lecture
Slides: PDF
Worksheet: PDF
Quiz: week 1

Week 3

Mon, Feb 6
Compute Resources (PyTorch Optimization)
Watch: Lecture

Week 4

Mon, Feb 13
Compute Resources (Locks)
Read: Mastering Concurrency in Python ("Working With Threads In Python" chapter)
Watch: Lecture
Slides: PDF
Worksheet: PDF
Fri, Feb 17
Deployment (Docker)
Watch: Lecture
Slides: PDF
Worksheet: PDF

Week 5

Mon, Feb 20
Docker/Catchup
Watch: Lecture
Wed, Feb 22
Storage Resources (File Systems)
Watch: Lecture
Slides: PDF
Quiz: week 4 and before (cumulative)
Part 2: Clusters and Hadoop Ecosystem

Week 6

Wed, Mar 1
Hadoop Ecosystem
Due: P2
Released: P3 (HDFS, Loans)
Watch: Lecture
Slides: PDF
Quiz: week 5 and before (cumulative)

Week 7

Fri, Mar 10
Review
Released: P4 (Spark, Loans)
Watch: Lecture

Week 8

Mon, Mar 13
Spring Break
Wed, Mar 15
Spring Break
Fri, Mar 17
Spring Break

Week 9

Mon, Mar 20
Midterm (in class)
Due: P3

Week 10

Mon, Mar 27
Spark Internals and Performance 2
Watch: Lecture
Fri, Mar 31
Cassandra Query Language (CQL)
Watch: Lecture

Week 11

Wed, Apr 5
Cassandra Quorums
Watch: Lecture
Quiz: week 10 and before (cumulative)

Week 12

Wed, Apr 12
Streaming: Kafka Demos
Watch: Lecture
Quiz: week 11 and before (cumulative)

Week 13

Wed, Apr 19
Streaming: Spark Concepts
Watch: Lecture
Slides: PDF
Quiz: week 12 and before (cumulative)
Part 3: The Cloud

Week 14

Mon, Apr 24
The Cloud
Watch: Lecture
Slides: PDF
Wed, Apr 26
Big Query 1
Released: P7 (BigQuery, Optional)
Watch: Lecture
Quiz: week 13 and before (cumulative)

Week 15

Wed, May 3
BigQuery 4 (and Cloud Deployment)
Watch: Lecture
Slides: PDF
Fri, May 5
Review
Due: P7
Watch: Lecture