Arvutiteaduse instituut
  1. Kursused
  2. 2024/25 sügis
  3. Sissejuhatus andmeteadusesse (LTAT.02.002)
EN
Logi sisse

Sissejuhatus andmeteadusesse 2024/25 sügis

  • Home
  • Lectures
  • Practice sessions
  • Homeworks
  • Projects
  • Projects from 2021
  • Call for Data and Project Topics
  • Exam
  • Tutorials
  • Dictionary ENG-EST

Video about organisational information (17 min), slides
Video about what course this is (10 min), slides

Lecture 01 - Introduction (watch before practice sessions Sept 9-11)

Video (35 min), slides

Lecture 02 - First look at the data (watch before practice sessions Sept 16-18)

First look at an attribute: Video (48 min), slides
Distribution of attributes: Video (45 min), slides

Lecture 03 - Data visualization (watch before practice sessions Sept 23-25)

Video (1h 45min), slides -- This lecture is by Raivo Kolde, Associate Professor of Health Informatics.

Optional additional material - Tableau and visualisation: Video 1, video 2

Lecture 04 - Frequent pattern mining (watch before practice sessions Sep 30-Oct 2)

Video (90 min), slides

Lecture 05 - Relations of attributes, clustering and dimensionality reduction (watch before practice sessions Oct 7-9)

Relations of attributes: Video (27 min), slides
Clustering: Video (42 min), slides
Dimensionality reduction: Video (21 min), slides

Lecture 06 - Introduction to machine learning (watch before practice sessions Oct 14-16)

Example prediction task: Lenses: Video (7 min), slides
Supervised learning terminology: Video (3 min), slides
Majority class classifier: Video (4 min), slides
Decision tree learning: Video (33 min), slides
Classification with K nearest neighbours: Video (4 min), slides
Example: hand-written digit recognition: Video (6 min), slides
Curse of dimensionality: Video (4 min), slides

Lecture 07 - Machine learning 2 (watch before practice sessions Oct 21-23)

Example: Decision tree on image data: Video (6 min), slides
Random forest: Video (4 min), slides
Example: Random forest on image data: Video (6 min), slides
Example: state of water: Video (11 min), slides
Linear classification and support vector machine: Video (15 min), slides
Underfitting and overfitting: Video (12 min), slides
Hyperparameter tuning and cross-validation: Video (11 min), slides
Machine learning pipeline: Video (3 min), slides
Learning on imbalanced data: Video (13 min), slides

Lecture 08 - Machine learning 3 (watch before practice sessions Oct 28-30)

Tradeoff between true positives and false positives: Video (8 min), slides
Scoring classifiers and ROC curves: Video (19 min), slides
What is regression? Video (5 min), slides
Linear regression: Video (11 min), slides
Regularisation and regression: Video (19 min), slides

Lecture 09 - Machine learning 4 (watch before practice sessions Nov 4-6)

What is deep learning: Video (25 min), slides
Training neural networks: Video (33 min), slides
Convolutional neural networks: Video (5 min), slides
Machine learning landscape: Video (14 min), slides

Organisational information and suggested topics for projects

Video (23 min), slides

NB! The deadline to create a team and a slide briefly describing your chosen topic is on Nov 11, at noon (12:00). See details in the above video and slides.

Lecture 10 - Computational statistics (watch before practice sessions Nov 18-20)

Sample and population: Video (9 min), slides
Example task with red and black cards: Video (26 min), slides
Histogram on a sample vs population: Video (13 min), slides
Some statistical terminology: Video (13 min), slides

Lecture 11 - CRISP-DM and pitfalls in data analysis (watch before practice sessions Nov 25-27)

CRISP-DM: Video (28 min), slides
P-value and effect size: Video (11 min), slides
Regression to the mean: Video (5 min), slides
Correlation and causation: Video (14 min), slides
Bias in data: Video (10 min), slides
P-hacking: Video (5 min), slides

Lecture 12 - Databases and privacy (last lecture!)

Data sources: Video (12 min), slides
Business intelligence: Video (11 min), slides
Data architecture: Video (11 min), slides
Relational databases: Video (13 min), slides
Privacy and anonymization: Video (19 min), slides
Discrimination concerns: Video (5 min), slides
Data protection regulations: Video (5 min), slides

Temporary problems, all video links seem broken, please find the respective videos here: link

  • Arvutiteaduse instituut
  • Loodus- ja täppisteaduste valdkond
  • Tartu Ülikool
Tehniliste probleemide või küsimuste korral kirjuta:

Kursuse sisu ja korralduslike küsimustega pöörduge kursuse korraldajate poole.
Õppematerjalide varalised autoriõigused kuuluvad Tartu Ülikoolile. Õppematerjalide kasutamine on lubatud autoriõiguse seaduses ettenähtud teose vaba kasutamise eesmärkidel ja tingimustel. Õppematerjalide kasutamisel on kasutaja kohustatud viitama õppematerjalide autorile.
Õppematerjalide kasutamine muudel eesmärkidel on lubatud ainult Tartu Ülikooli eelneval kirjalikul nõusolekul.
Courses’i keskkonna kasutustingimused