Schedule

This is a tentative schedule. It will be updated according to the actual progress.

  • Event
    Date
    Description
    Course Material
  • Lecture
    04/09/2024 (Wed)
    1: Course Admin; Era of Big Data Analytics;

    Readings: Ch1 of [Blum]

    Additional references: [DataCenter]

  • Lecture
    11/09/2024 (Wed)
    2: Computing as a Utility; Data-center Architecture

    Readings: Ch1 of [MMDS]

  • Due
    14/09/2024 23:59PM
    Saturday
    Homework 0 is due!
  • Holiday
    18/09/2024 (Wed)
    The day following the Chinese Mid-Autumn Festival
  • Lecture
    25/09/2024 (Wed)
    02/10/2024 (Wed)
    3&4: MapReduce/ Hadoop ; The Big Data Processing stack

    Readings: Ch2.1-2.4 of [MMDS] Ch2 of [JLin] Ch3.1-3.4 of [JLin]

    Additional references: [CloudData]

  • Lecture
    09/10/2024 (Wed)
    5: Frequent Item-Set Mining and Association Rules

    Readings: Ch6.1-6.4 of [MMDS]

  • On Leave
    16/10/2024 (Wed)
    The instructor will be on conference leave - No Lectures. The make-up lecture will be on Dec. 4 (venue is to be decided).
  • Lecture
    23/10/2024 (Wed)
    6: Finding Similar Items and LSH

    Readings: Ch3.1-3.5 of [MMDS]

    Additional references: [ZG]

  • Lecture
    30/10/2024 (Wed)
    7: Clustering and GMM

    Readings: Ch7.1-7.4 of [MMDS] Ch11 of [MMDS] Ch.9 of [CBishop] [MLE/MAP]

  • Lecture
    06/11/2024 (Wed)
    8: Dimension Reduction

    Readings: Ch11 of [MMDS]

    Additional references: [PCA] [GuruswamiKannan]

  • Lecture
    13/11/2024 (Wed)
    9: Recommendation Systems
  • Lecture
    20/11/2024 (Wed)
    10: Regression and Gradient Descent ; Recommendation Systems (cont'd)

    Readings: Ch9 of [MMDS]

    Additional references: [Netflix09] [KorenTalk] [ANg] [Pedregosa18] [Sra18]

  • Lecture
    27/11/2024 (Wed)
    11: Data Stream Algorithms

    Readings: Ch4.1-4.5 of [MMDS]

  • Make-up Lecture
    04/12/2024 (Wed)
    12: Data Stream Algorithms (cont'd) / Overflow

    Readings: Ch0,Ch1,Ch4.4,Ch6 of [ChakDataStream]