Introduction to Data Science (Sissejuhatus andmeteadusesse) - LTAT.02.002
This course gives a brief overview of the basic concepts, principles and practice of data science. The main goal is to learn to plan and carry out a simple practical data science project. The course covers the main methods for descriptive data analysis and visualization, frequent pattern mining, cluster analysis, principal components analysis, common methods of machine learning for classification and regression (including deep neural networks), managing data and interpreting results of statistical tests. The main stages of data science projects are discussed and available software tools reviewed. Homeworks are to be solved using the programming language Python 3 and its libraries.
Course information:
- The course starts with the lecture on September 3, 2020. There will be no practice sessions during the first week. The first practice sessions are on September 7 (groups 1 and 2), September 8 (groups 5, 6 and 7) and September 9 (groups 3 and 4). The first homework is due on September 21 at noon.
- Lectures (Meelis Kull) - all will be held online only and recorded:
- Thursday 14:15 - 16:00, online - Please log in here on this course homepage to see the links for online participation in lectures and practice sessions.
- Practice Sessions - will be held physically and online simultaneously, and sometimes only online, please log in here on this course homepage to see the link for practice session location updates.
- Group 1: Monday 16:15 - 18:00 (Narva mnt 18, room 2047 + online ) (Victor Pinheiro) in English
- Group 2: Monday 16:15 - 18:00 (Narva mnt 18, room 2045 + online ) (Anna Aljanaki) in Estonian
- Group 3: Wednesday 12:15 - 14:00 (Narva mnt 18, room 1021 + online ) (Markus Kängsepp) in Estonian
- Group 4: Wednesday 12:15 - 14:00 (Narva mnt 18, room 2047 + online ) (Victor Pinheiro) in English
- Group 5: Tuesday 16:15 - 18:00 (Narva mnt 18, room 2010 + online ) (Anna Aljanaki) in Estonian
- Group 6: Tuesday 16:15 - 18:00 (Narva mnt 18, room 2034 + online ) (Victor Pinheiro) in English
- Group 7: Tuesday 10:15 - 12:00 (Narva mnt 18, room 1022 + online ) (Victor Pinheiro) in English
- Homework deadlines: Monday at noon (12:00)
NB! According to the university rules for the Delta building, every student must self-register each physical practice session attendance at http://cs.ut.ee/reg
Contacts:
- Course forum: https://piazza.com/ut.ee/fall2020/ltat02002
We will use Piazza for questions and discussions. In the forum, you can post questions (also anonymously) about homeworks or course organization etc. And we can keep the discussion separate for different topics. After the first lecture, you should all receive a welcome e-mail that invites you to piazza - don't ignore it and register there (it is sent to your address that is in the study information system SIS/ÕIS). If you somehow don't get the e-mail you can register here (just mark that you are a student and press "Join Classes"). Then you have to fill some information about yourself, which is a little annoying, but do it anyway. The home page of the course forum is here and you can click on Q&A to get to the forum part. That's it.
- Lecturer: Meelis Kull (meelis.kull@ut.ee)
- Teaching Assistants:
- Victor Pinheiro (victor.pinheiro@ut.ee)
- Anna Aljanaki (anna.aljanaki@ut.ee)
- Markus Kängsepp (markus.kangsepp@ut.ee)
Grading and requirements:
The grade is calculated from the total number of points (max 100). The points can be earned as follows:
- Homeworks (40 points): there will be 10 homeworks, each worth 4 points;
- Group project and presentation at the poster session (20 points);
- Written exam (40 points);
- Additional points can be earned from bonus tasks within homeworks;
- Attending at least 9 of the 12 practice sessions is compulsory (physical presence and online presence are both acceptable): after missing 3 practice sessions, each additional missed practice session results in losing 5 points. To keep track of your practice session attendances, login to the course webpage to find the link here.
In order to pass the course, the student must get at least 50% from homeworks (threshold 20 points), at least 50% from the project (threshold 10 points) and at least 50% from the exam (threshold 20 points). If having a written exam with physical presence turns out to be impossible due to the pandemic, then we will either organize an online exam or multiply the points from the homeworks and project by 1.67.
Links to previous courses:
2019 fall
2018 fall
2017 fall
2017 spring
2016
2015
2014
2013
2012
2011
2009