Course Schedule

Part 1: Resources and Deployment

Week 1

Mon, Sep 4
Labor Day
Wed, Sep 6
Course Intro
Read: Syllabus
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Fri, Sep 8
Deployment (Linux Shell)
Watch: Lecture
Slides: PDF
Worksheet: PDF
Anki Flashcards: Deck

Week 2

Wed, Sep 13
Deployment (Docker)
Released: P1 (Docker)
Watch: Lecture
Slides: PDF
Worksheet: PDF
Anki Flashcards: Deck
Quiz: week 1

Week 3

Fri, Sep 22
Compute Resources (Locks)
Read: Mastering Concurrency in Python ("Working With Threads In Python" chapter)
Watch: Lecture
Slides: PDF
Worksheet: PDF
Anki Flashcards: Deck

Week 4

Mon, Sep 25
Memory Resources (Caching)
Read: Systems Performance, Gregg (6.2.2; "CPU Caches" and "Latency" subsections of 6.4.1)
Watch: Lecture
Slides: PDF
Worksheet: PDF
Anki Flashcards: Deck
Wed, Sep 27
Memory Resources (Caching Practice)
Watch: Lecture
Quiz: week 3 and before (cumulative)
Fri, Sep 29
Memory Resources (PyArrow)
Read: Gallery of Processor Cache Effects (Examples 1 and 2)
Due: P2
Released: P3 (Threads+Caching+gRPC, Model Serving)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 5

Mon, Oct 2
Network Resources (Overview)
Read: Designing Data Intensive Applications, Kleppmann (Chapter 4, "Encoding and Evolution")
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Oct 4
Network Resources (gRPC+Compose)
Read: gRPC Basics Tutorial
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Quiz: week 4 and before (cumulative)
Fri, Oct 6
Storage Resources (File Systems)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 6

Fri, Oct 13
HDFS Overview
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Part 2: Clusters and Hadoop Ecosystem

Week 7

Fri, Oct 20
Spark RDDs
Watch: Lecture

Week 8

Mon, Oct 23
Midterm (in class)
Anki Flashcards: Deck

Week 9

Mon, Oct 30
Spark Internals and Performance
Read: Learning Spark, 2nd edition by Damji et al. (Chapter 7, "Optimizing and Tuning Spark Applications")
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Fri, Nov 3
Wide Tables: HBase and Cassandra
Read: Cassandra, The Definitive Guide, by Carpenter et al. (Chapter 4, "The Cassandra Query Language")
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 10

Mon, Nov 6
Cassandra Query Language (CQL)
Watch: Lecture
Anki Flashcards: Deck
Fri, Nov 10
Cassandra Replication
Due: P5
Released: P6 (Cassandra, Weather)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 11

Wed, Nov 15
Streaming: Kafka Demos
Watch: Lecture
Anki Flashcards: Deck
Quiz: week 10 and before (cumulative)
Fri, Nov 17
Streaming: Kafka Reliability
Read: Kafka, The Definitive Guide, 2nd edition by Shapira et al. (Chapter 7, "Reliable Data Delivery")
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck

Week 12

Mon, Nov 20
Streaming: Spark Programming
Read: Learning Spark, 2nd edition by Damji et al. (Chapter 8, "Structured Streaming")
Watch: Lecture
Anki Flashcards: Deck
Wed, Nov 22
Streaming: Spark Concepts
Due: P6
Released: P7 (Kafka, Weather Stations)
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Quiz: week 11 and before (cumulative)
Fri, Nov 24
Thanksgiving Break
Part 3: The Cloud

Week 13

Mon, Nov 27
The Cloud
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Nov 29
Big Query 1
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Quiz: week 12 and before (cumulative)

Week 14

Mon, Dec 4
Cancelled
Please use the extra time to work on P7.
Fri, Dec 8
Big Query 4
Watch: Lecture
Anki Flashcards: Deck

Week 15

Mon, Dec 11
Cloud Deployment
Watch: Lecture
Slides: PDF
Anki Flashcards: Deck
Wed, Dec 13
Review
Due: P8
Morning Review Session
Afternoon Review Session
Anki Flashcards: Deck