Institute of Computer Science
  1. Courses
  2. 2023/24 fall
  3. Special Course in Machine Learning: AI-Safety (MTAT.03.317)
ET
Log in

Special Course in Machine Learning: AI-Safety 2023/24 fall

Older Datamining Seminars: 2008k » 2008s » 2009k » 2009s » 2010k » 2011k » 2012s » 2014k » 2014s » 2014k

  • About
  • Points
  • Project
  • Additional materials
  • Timetable
  • Homeworks
  • Project ideas

Multi-agent communication

Show that multiple agents can learn to communicate in a simple gridworld task. The project is based on article "Cerebral coherence between communicators marks the emergence of meaning", see also the movies in supplemental material.

Supervisor: Jaan Aru
Difficulty: medium

Torcs racing game

Teach a car to drive in racing game. You need to make use of Torcs Championship Server and Python client. The installation and use of championship server is documented in manual.

Supervisor: Tambet Matiisen
Difficulty: easy

Teach computer to solve equations

Given 3 numbers A, B and C, choose an operation so that A (op) B = C. For example given A=2, B=3 and C=5, the correct operation would be +. The state space is three integers, action is either +, -, * or /. The reward is 1, when answer is correct, otherwise 0.

Supervisor: Tambet Matiisen
Difficulty: too easy?

Experiment with MazeBase

MazeBase is a simple environment for creating very simple 2D games and training neural network models to perform tasks within them. It was developed by Facebook AI research group to specifically have a control over difficulty in reinforcement learning problems. The system is described in a paper, the code is on GitHub.

Supervisor: Tambet Matiisen
Difficulty: medium

Experiment with WebNav

WebNav is a task recently proposed NLP researchers. In this challenging task, an agent navigates through a web site, which is represented as a graph consisting of web pages as nodes and hyperlinks as directed edges, to find a web page in which a query appears. Example dataset is based on English Wikipedia. Alternatively you could implement and learn game "5 Clicks to Jesus" on the same dataset.

Supervisor: Tambet Matiisen
Difficulty: hard

Replicate DeepMind's Atari results

DeepMind's Atari paper sparked the interest in deep reinforcement learning and improved version was later featured on the cover of Nature. Your job is to replicate their results. You need to use Arcade Learning Environment (especially Python API), some toolkit for convolutional neural networks (say Keras or Neon) and a lot of GPU power.

Supervisor: Tambet Matiisen
Difficulty: hard

Presentation about AlphaGo

Do an in-depth presentation about DeepMind's AlphaGo engine - how it works, what algorithms are used and how they are combined.

Supervisor: Ilya Kuzovkin
Difficulty: medium

  • Institute of Computer Science
  • Faculty of Science and Technology
  • University of Tartu
In case of technical problems or questions write to:

Contact the course organizers with the organizational and course content questions.
The proprietary copyrights of educational materials belong to the University of Tartu. The use of educational materials is permitted for the purposes and under the conditions provided for in the copyright law for the free use of a work. When using educational materials, the user is obligated to give credit to the author of the educational materials.
The use of educational materials for other purposes is allowed only with the prior written consent of the University of Tartu.
Terms of use for the Courses environment