Institute of Computer Science
  1. Courses
  2. 2018/19 spring
  3. Machine Learning II (LTAT.02.004)
ET
Log in

Machine Learning II 2018/19 spring

Previous years: 2008 » 2012 » 2013 » 2014

  • Main
  • Lectures
  • Exercise sessions
  • Grading
  • Piazza forum
  • Upload

I. Performance evaluation measures

Given by Sven Laur

Brief summary: Theoretical formalisation of performance measures. Loss function and risk. Empirical risk and its convergence to risk. Confidence intervals for risk estimates and the size of the hold-out test set. Why training error is biased. Optimism as a way to correct training error. Bias-variance dilemma and crossvalidation as an engineering solution. Properties of k-fold crossvalidation. Moment matching and naive confidence intervals for the crossvalidation error. Why the naive estimate underestimates the variance of the crossvalidation error. Bootstrap as an alternative to crossvalidation.

Literature

  • Bousquet, Boucheron and Lugosi: Introduction to Statistical Learning Theory
  • Bengio and Grandvalet: No Unbiased Estimator of the Variance of K-Fold Cross-Validation
  • Jüri Lember:Tehisõppe loengukonspekt
  • Friedman, Tibshirani and Hastie: The Elements of Statistical Learning
  • Institute of Computer Science
  • Faculty of Science and Technology
  • University of Tartu
In case of technical problems or questions write to:

Contact the course organizers with the organizational and course content questions.
The proprietary copyrights of educational materials belong to the University of Tartu. The use of educational materials is permitted for the purposes and under the conditions provided for in the copyright law for the free use of a work. When using educational materials, the user is obligated to give credit to the author of the educational materials.
The use of educational materials for other purposes is allowed only with the prior written consent of the University of Tartu.
Terms of use for the Courses environment