Schedule
This is a tentative schedule. It will be updated according to the actual progress.
-
Week/DateDescriptionCourse MaterialRemark
-
Week 1&2 (@WMY 407)
10/01/2026 (Sat)
17/01/2026 (Sat)
Resource Management and Infrastructure for Big Data Systems and Cloud-Native Applications; An Introduction to ZooKeeper (ZK)Readings: [YARN]; [Mesos]; Ch.2-3 of [Hadoop]; [CloudData]; [Kubernetes1];
Additional references: [Borg]; [Omega]; [Sparrow]; [Apollo]; [Mercury]; [MapReduceFamilySurvey2013]; [Kubernetes2];
-
24/01/2026 (Sat)
Lecture cancelledInstructor on Conference Leave -
Week 4 (@WMY 407)
31/01/2026 (Sat)
DAG-based Dataflow Systems: Dryad, DryadLINQ, Tez and Beyond[slides] -
Week 5
02/02/2026 (Mon)
High-level Big Data Query Languages: Pig and Hive[slides]Readings: [PigLatin]; [Hive1]; Ch.16-17 of [Hadoop];
Additional references: [Hive2]; [Hive3]; [HiveAdvances]; [Pig]; [Hive];
-
Week 6
09/02/2026 (Mon)
BDAS and Spark[slides]Readings: [Spark2018]
Additional references: [SparkScaling] [MapReduceVsSpark] Ch.1, Ch.10 of [LearnSpark] Appendix A of [SparkAnalytics]
-
10/02/2026 23:59PM
TuesdayAssignment #1 - Hadoop over Kubernetes is due!Due -
16/02/2026 (Mon)
Lunar New Year VacationHoliday -
Week 7
23/02/2026 (Mon)
BDAS and Spark(cont'd)Readings: [Spark2018]
Additional references: [SparkScaling] [MapReduceVsSpark] Ch.1, Ch.10 of [LearnSpark] Appendix A of [SparkAnalytics]
-
01/03/2026 23:59PM
SundayAssignment #2 - Pig, Hive and SparkRDD is due!Due -
02/03/2026 (Mon)
Reading WeekHoliday -
Week 8
09/03/2026 (Mon)
Spark SQL[slides]Readings: [SparkSQL] [LearnSpark2]
Additional references: [SharkSQL] [SparkMBase] Ch.3-6 of [LearnSpark2ndEd]
-
Week 9&10
16/03/2026 (Mon)
23/03/2026 (Mon)
Big Stream Processing frameworks: Unified Log via Apache Kafka; Storm ; Spark Streaming ; Spark Structural Streaming ; Lambda & Kappa Architecture;Readings: [Storm@Twitter]; [Heron]; [SparkStreaming]; Ch.8 of [LearnSpark2ndEd];
Additional references: [KafkaBook]; [KleppmannMSSS]; [StormApplied];
-
Week 11
30/03/2026 (Mon)
Big Graph Processing frameworks: Pregel/Giraph and GraphLab ; GraphX, GraphFrame;[slides]Readings: [GraphLab1]; [PowerGraph]; [GraphX];
Additional references: [GraphChi];
-
06/04/2026 (Mon)
The day following Ching Ming FestivalHoliday -
07/04/2026 23:59PM
TuesdayAssignment #3 - SparkSQL, Kafka, and Streaming is due!Due -
Week 12
13/04/2026 (Mon)
Big Data Stores (aka NoSQL Databases)[slides]Readings: [Dynamo] [BigTable] [RealtimeHadoopFacebook] Ch.20 of [Hadoop] [Cassandra]
Additional references: [HBase] [CassandraBook]
-
Week 13(@SHB 801)
20/04/2026 (Mon)
Spark Machine Learning Support and Beyond (time-permitting)Readings: [SparkMLlib] Ch.11 of [LearnSpark] Ch.9 of [LearnSpark2ndEd]
Make-up lecture -
27/04/2026 23:59PM
MondayAssignment #4 - GraphFrames, HBase is due!Due -
04/05/2026 17:30
MondayFinal Exam (@ New Asia College Gymnasium)Exam
