Most of the lecture and practice session material can be found from this GitHub repository: https://github.com/KristoR/data_engineering_2024
The course schedule for 2024:
Week | Lecture Date | Lecture Topic | Practice Date | Practice Topic |
---|---|---|---|---|
1 | 2024-09-02 | No class | 2024-09-05 | No class |
2 | 2024-09-09 | Intro | 2024-09-12 | Docker+Postgres |
3 | 2024-09-16 | Data processing and orchestration | 2024-09-19 | Airflow |
4 | 2024-09-23 | Data modelling | 2024-09-26 | Kimball |
5 | 2024-09-30 | Data transformation | 2024-10-03 | dbt |
6 | 2024-10-07 | Data storage | 2024-10-10 | DuckDB |
7 | 2024-10-14 | NoSQL (pre-recorded) | 2024-10-17 | MongoDB (online) |
8 | 2024-10-21 | Data Lakes | 2024-10-24 | Delta, Iceberg |
9 | 2024-10-28 | Graph Databases (online) | 2024-10-31 | Neo4j (online) |
10 | 2024-11-04 | Security and privacy | 2024-11-07 | Security and privacy |
11 | 2024-11-11 | Data governance | 2024-11-14 | Open Metadata (pre-recorded) |
12 | 2024-11-18 | Key-Value stores (online) | 2024-11-21 | Redis |
13 | 2024-11-25 | Data visualization | 2024-11-28 | Streamlit |
14 | 2024-12-02 | Exam | 2024-12-05 | Working in class (tutoring for project) |
15 | 2024-12-09 | Working in class (tutoring for project) | 2024-12-12 | Working in class (tutoring for project) |
16 | 2024-12-16 | Project presentation (poster session) | 2024-12-19 | Redo exam |