Institute of Computer Science
  1. Courses
  2. 2023/24 fall
  3. Special Course in Machine Learning: AI-Safety (MTAT.03.317)
ET
Log in

Special Course in Machine Learning: AI-Safety 2023/24 fall

Older Datamining Seminars: 2008k » 2008s » 2009k » 2009s » 2010k » 2011k » 2012s » 2014k » 2014s » 2014k

  • About
  • Points
  • Project
  • Additional materials
  • Timetable
  • Homeworks
  • Project ideas

Timetable

The video lecture are from David Silver's reinforcement learning course. Some homeworks are also from his course, but others are from Berkley deep reinforcement learning course.

DateVideo lectureAssignmentTest presenterHomework presenter
8.02.2016Introduction to Reinforcement Learning (slides)none  
15.02.2016Markov Decision Processes (slides) (test)Implementation of Easy21Zura IsakadzeAqeel Labash
22.02.2016Planning by Dynamic Programming (slides) (test)Value Iteration and Policy IterationKristjan JansonsLauri Tammeveski
29.02.2016Model-Free Prediction (slides) (test)Monte-Carlo Control in Easy21Irene TeinemaaGagandeep Singh
7.03.2016Model Free Control (slides) (test)TD Learning in Easy21Lauri TammeveskiKristjan Jansons
14.03.2016Value Function Approximation (slides) (test)Linear Function Approximation in Easy21Aqeel LabashÜllar Lindmaa
21.03.2016Policy Gradient Methods (slides) (test)Implement policy gradient methodArdi TampuuIrene Teinemaa
28.03.2016Integrating Learning and Planning (slides) (test)Implement some enhancement or variation on policy gradient optimization algorithmIlya KuzovkinZura Isakadze
4.04.2016Exploration and Exploitation (slides) (test)Experiment with Atari domainÜllar LindmaaIlya Kuzovkin
11.04.2016Case Study: RL in Classic Games (slides) (test)TORCS rally simulator and RLDaniel MajoralArdi Tampuu
18.04.2016Bonus video: Richard Sutton's introduction to reinforcement learning   
25.04.2016Bonus video: David Silver's lecture about deep reinforcement learning   
2.05.2016Bonus video: Nando De Freitas's lecture about reinforcement learning with direct policy search   
9.05.2016Bonus video: Nando De Freitas's lecture about reinforcement learning with action-value functions   
16.05.2016Torcs competition rehearsal   
17.05.2016Jaan Tallinn's lecture about AI control as reinforcement learning problem 15:15 in Paabel
Torcs competition after Jaan Tallinn's lecture
23.05.2016Mastering the game of Go with deep neural networks and tree search by Ilya Kuzovkin
  • Institute of Computer Science
  • Faculty of Science and Technology
  • University of Tartu
In case of technical problems or questions write to:

Contact the course organizers with the organizational and course content questions.
The proprietary copyrights of educational materials belong to the University of Tartu. The use of educational materials is permitted for the purposes and under the conditions provided for in the copyright law for the free use of a work. When using educational materials, the user is obligated to give credit to the author of the educational materials.
The use of educational materials for other purposes is allowed only with the prior written consent of the University of Tartu.
Terms of use for the Courses environment