2015 · 2015-2016 · 2016 · 2017 · 2018 · 2019 · 2020 · 2021 · 2022 · Canvas
LSDE: Large Scale Data Engineering 2022
Lecture Schedule
Lecture Topic 1

Summary: Cloud Computing, Assignment 1

Lecture Topic 2

Summary: Spark

  • 2022/09/12@23:59
    • deadline assignment 1a (submission is the last github commit before that time)
  • 2022/09/13
  • 2022/09/16
Lecture Topic 3

Summary: SQL on Big Data

  • 2022/09/19@23:59
    • deadline assignment 1b (submission is the last github commit before that time)
    • practicum groups of 3 students each will be formed (self-enrollment via Canvas)
  • 2022/09/20
    • discussion of assignment 1b
  • 2022/09/23

Assignment 2: Big Data Project

Summary: Assignment 2

  • 2022/09/26@23:59
    • deadline assignment 1c (submission is the last github commit before that time)
  • 2022/09/27
    • Assignment 2 topics will be chosen FCFS in order of 1c leaderboard ranking
  • 2022/10/04
    • finish SQL on Big Data + tips on using Databricks
  • 2022/10/04@23:59
    • deadline1 assignment 2a (submit draft project plans via Canvas)

Lecture Topic 4

Summary: Cloud Database Systems

  • 2022/10/07
  • 2022/10/07@23:59
    • deadline2 assignment 2a (submit peer reviews via Canvas)
  • Lecture Topic 5

    Summary: Scalable Machine Learning

    Bonus Topic 6

    Summary: Transactional Databases at Large Scale

    Project Question Sessions (10-minutes per group)

    Group meetings take place in the LSDE zoom channel (same as lectures).

    All group members must be present. You may prepare a few (max 4) slides with talking points, or results/problems/questions.

    • One-on-one group meetings: 2022/10/13 (Thursday) and 2022/10/14 (Friday)
    • One-on-one group meetings: 2022/10/20 (Thursday) and 2022/10/21 (Friday)

    Project Presentations

    The goal of these 8-minute presentations is to share with all students the results of your project. You can speak for only 5 minutes (hard cut-off). Please concentrate on the highlights. Then there is 3 minutes for questions and presenter change. The order of group presentations will be determined just-in-time. The presentations will be graded, and all group members must be present, and participate in question answering.

    The presentations need to be in plain PDF and should have been submitted to Canvas the night before. There will be a presentation laptop (with chrome and safari and internet). You cannot use your own laptop - all of this to streamline the presenter changes.

    It is a very long session. There will also be a short guest lecture. Therefore, I actually recommend only attending your half (depending on whether your group is even numbered, you would attend 13:25-15:50; or otherwise 15:10-17:35).

    • 2022/10/27@23:59
      • deadline assignment 2b (send in presentation via Canvas)
    • 2022/10/28 room WN-M143, 13:30-17:30 CEST
      • 13:30-15:05 Final Project Presentations Groups with even number
      • 15:05-15:15 break
      • 15:15-15:45 guest presentation by Bart Samwel (Databricks Amsterdam)
      • 15:45-15:55 break
      • 15:55-17:30 Final Project Presentations Groups with odd number