Institute of Computer Science
  1. Courses
  2. 2022/23 spring
  3. Data engineering for Conversion Master's (LTAT.02.026)
ET
Log in

Data engineering for Conversion Master's 2022/23 spring

  • Main
  • Lectures
  • Project
  • Homework
  • References

Homework

There are no tasks for this course.

HW3 (due 29.05)

Apache Spark DataFrames and SQL with Yelp Dataset

Dive into Big Data analysis using Apache Spark DataFrames and SQL on the Yelp dataset. Process and manipulate data in parallel, learning to load Yelp tables as DataFrames, extract user statistics, scrutinize businesses, and generate pivot tables with the Spark DataFrame API and Spark SQL. Ensure a functional Spark environment, and submit Python scripts and outputs as deliverables. BigDataLab

HW2 (due 10.04)

ETL Process for Air Quality Data

Perform an ETL (Extract, Transform, Load) process on air quality data from http://airviro.klab.ee/ and create tables with hourly, daily, and monthly average values for all columns in the dataset. Adhere to data management principles, maintain an organized file structure, and document the process in a README.md file. Publish the code on GitHub (private repositories are allowed). Further instructions

HW1 (due 27.03)

Data Source Exploration for Group Project

Identify a suitable data source for a group project and briefly describe its key attributes, such as data type, purpose, update frequency, ownership, and other relevant aspects. Further instructions

  • Institute of Computer Science
  • Faculty of Science and Technology
  • University of Tartu
In case of technical problems or questions write to:

Contact the course organizers with the organizational and course content questions.
The proprietary copyrights of educational materials belong to the University of Tartu. The use of educational materials is permitted for the purposes and under the conditions provided for in the copyright law for the free use of a work. When using educational materials, the user is obligated to give credit to the author of the educational materials.
The use of educational materials for other purposes is allowed only with the prior written consent of the University of Tartu.
Terms of use for the Courses environment