This is a tentative schedule. It will be updated according to the actual progress.
Course Admin; Era of Big Data ; Data-center Architecture
SaturdayAssignment #0 - Hadoop Cluster Setup is due!
Chinese Lunar New Year
Programming Models for Big Data Computing: MapReduce/ Hadoop, GFS/HDFS
Resource Management Platforms for Big Data Processing Systems
High-level Big Data Query Languages: Pig and Hive
TuesdayAssignment #1 - Community Detection is due!
BDAS and Spark
Big Stream Processing frameworks: Unified Log via Apache Kafka; Storm ; Spark Streaming ; Spark Structural Streaming ; Lambda & Kappa Architecture;
MondayAssignment #2 - Pig, Hive and SparkRDD is due!
Big Graph Processing frameworks: Pregel/Giraph and GraphLab ; GraphX, GraphFrame;
Ching Ming Festival
WednesdayAssignment #3 - Kafka is due!
Big Data Stores (aka NoSQL Databases)
Spark Machine Learning Support and Beyond (time-permitting)
ThursdayAssignment #4 - GraphFrames, GraphX, HBase is due!
FridayProject is due![Project]