Institute of Computer Science
  1. Courses
  2. 2018/19 spring
  3. Machine Learning II (LTAT.02.004)
ET
Log in

Machine Learning II 2018/19 spring

Previous years: 2008 » 2012 » 2013 » 2014

  • Main
  • Lectures
  • Exercise sessions
  • Grading
  • Piazza forum
  • Upload

III. Performance evaluation measures

Given by Sven Laur

Brief summary: Principles of experiment design. Machine learning as minimisation of future costs. Overview of standard loss functions. Stochastic estimation of future costs by random sampling (Monte-Carlo integration). Theoretical limitations. Standard validation methods: holdout, randomised holdout, cross-validation, leave-one-out, bootstrapping. Advantages and drawbacks of standard validation methods

Slides: PDF

Videos:

  • Lecture on UTTV (2016)
  • Lecture on UTTV (2015)
  • Screencast on cross-validation

Literature:

  • Davison and Hinkley: Bootstrap Methods and Their Application
  • Molinaro, Simon and Pfeiffer: Prediction Error Estimation: A Comparison of Resampling Methods
  • Arlot and Celisse: A survey of cross-validation procedures for model selection
  • Efron: Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation
  • Efron and Tibshirani: Improvements on Cross-Validation: The .632+ Bootstrap Method
  • Wolfgang Härardle: Applied Nonparametric Regression: Choosing the smoothing parameter (Chapter 5)
  • Yang: Can the Strengths of AIC and BIC Be Shared?
  • van Erven, Grunwald and de Rooij:Catching Up Faster by Switching Sooner: A Prequential Solution to the AIC-BIC Dilemma

Complementary exercises:

  • Generate data form a simple linear or polynomial regression model and use various validation methods and report results:
    • Did a training method chose a correct model
    • Is there some differences when the correct model is not feasible?
    • Estimate bias and variance of a training method
    • Did a validation method correctly estimated expected losses
  • Try various classification and linear regression methods together with various validation methods report the results
    • Iris dataset
    • Computer Hardware Data Set
    • Housing Data Set
    • Datasets for testing linear regression models

Free implementations:

  • Boot package in R
  • Some methods in the rminer package in R
  • Institute of Computer Science
  • Faculty of Science and Technology
  • University of Tartu
In case of technical problems or questions write to:

Contact the course organizers with the organizational and course content questions.
The proprietary copyrights of educational materials belong to the University of Tartu. The use of educational materials is permitted for the purposes and under the conditions provided for in the copyright law for the free use of a work. When using educational materials, the user is obligated to give credit to the author of the educational materials.
The use of educational materials for other purposes is allowed only with the prior written consent of the University of Tartu.
Terms of use for the Courses environment