Introduction to Data Science (Sissejuhatus andmeteadusesse) - LTAT.02.002
This course gives a brief overview of the basic concepts, principles and practice of data science. The main goal is to learn to plan and carry out a simple practical data science project. The course covers the main methods for descriptive data analysis and visualization, frequent pattern mining, cluster analysis, principal components analysis, common methods of machine learning for classification and regression (including deep neural networks), managing data and interpreting results of statistical tests. The main stages of data science projects are discussed and available software tools reviewed. Homeworks are to be solved using the programming language Python 3 and its libraries.
Course information:
- We do not use Moodle in this course. All information is either here at the course homepage or announced at the course forum (link given below).
- The course will have pre-recorded lectures made available each week. There will be no practice sessions during the first week. The first practice sessions are on September 9 (groups 1 and 2), September 10 (groups 3, 4, 5, 6 and 7) and September 11 (group 8). The first homework is due on September 23 at noon.
- Lectures (Meelis Kull) - all will be pre-recorded. If you have any questions feel free to ask them on the course forum.
- Practice Sessions:
- Group 1: Monday 16:15 - 18:00 online (Victor Pinheiro) in English - - Please log in here on this course homepage to see the link for online participation.
- Group 2: Monday 16:15 - 18:00 Narva mnt 18, room 2010 (Carel Kuusk) in Estonian
- Group 3: Tuesday 10:15 - 12:00 Narva mnt 18, room 1008 (Hasan Tanvir) in English
- Group 4: Tuesday 10:15 - 12:00 Narva mnt 18, room 2048 (Friedrich Krull) in Estonian
- Group 5: Tuesday 12:15 - 14:00 Narva mnt 18, room 2048 (Markus Haug) in Estonian
- Group 6: Tuesday 12:15 - 14:00 Narva mnt 18, room 1022 (Hasan Tanvir) in English
- Group 7: Tuesday 16:15 - 18:00 online (Victor Pinheiro) in English - - Please log in here on this course homepage to see the link for online participation.
- Group 8: Wednesday 16:15 - 18:00 online (Victor Pinheiro) in English - - Please log in here on this course homepage to see the link for online participation.
Contacts:
- Course forum: https://campuswire.com/c/G4A59AEB9 . We will use Campuswire for questions and discussions. In the forum, you can post questions (also anonymously) about homeworks or course organization etc. And we can keep the discussion separate for different topics. By September 16, you should have all received a welcome e-mail that invites you to Campuswire - don't ignore it and register there (it is sent to your address that is in the study information system). If you somehow didn't get the e-mail then please ask your practice session teacher.
- Lecturer: Meelis Kull (meelis.kull@ut.ee)
- Teaching Assistants:
- Carel Kuusk
- Friedrich Krull (krullfriedrich@gmail.com)
- Hasan Tanvir (hasan.tanvir@ut.ee)
- Markus Haug
- Victor Pinheiro (victor.pinheiro@ut.ee)
Grading and requirements:
The grade is calculated from the total number of points (max 100). The points can be earned as follows:
- Homeworks (20 points): there will be 10 homeworks, each worth 2 points;
- Group project and presentation at the poster session (30 points);
- Written exam (50 points);
- Additional points can be earned from bonus tasks within homeworks;
- Attending at least 8 of the 10 practice sessions is compulsory: after missing 2 practice sessions, each additional missed practice session results in losing 2 points (except for medical reasons, then please contact the practice group's teacher)
In order to pass the course, the student must get at least 50% from homeworks after applying above attendance penalties (threshold 10 points), at least 50% from the project (threshold 15 points) and at least 50% from the exam (threshold 25 points).