Institute of Computer Science
  1. Courses
  2. 2018/19 spring
  3. Natural language processing (LTAT.01.001)
ET
Log in

Natural language processing 2018/19 spring

  • HomePage
  • Schedule
  • Labs
  • Reading Tests
  • Homeworks
  • Project
  • Results

Project

Every student must make a small project that contributes 30% to the final grade. The projects are individual, although several students can work on the same topic. This means that you can collaborate on the same topic but in the end everyone will submit their own individual work which cannot be identical to the work submitted by any other student.

There are two of options for choosing the project topic:

A. Choose the topic that you like

Because there are only four weeks for working on the project, the task cannot be too large. Also, make sure that suitable training data is freely available. If you choose that option, you can consult either me (Kairit, by e-mail or office hours) or Maksym (by e-mail, on Piazza, or after practice session) to make sure that your chosen task is suitable for the project.

For type A project, you need to do the following:

  1. Choose your topic.
  2. Find few (1-3) reference papers as related work.
  3. Implement your model and run experiments.
  4. Write a short report about your work.
  5. Submit the code and the report as your project.

B. Choose a model implemented in AllenNLP (https://allennlp.org/models)

For each model, the following resources are available: the paper describing the system and the code. Annotated training data is not available for every model, thus you can only choose from a subset of models. The following table describes possible options.

ModelData
Machine comprehensionSQuAD (available from AllenNLP)
Textual entailmentSNLI (available from AllenNLP)
Coreference resolutionOther datasets might be available, for instance https://preschool-lab.github.io/PreCo/
Named entity recognitionOther datasets might be available (there is also one in Estonian)
Dependency parsingUD datasets are available: https://universaldependencies.org/
Event2MindEvent2Mind corpus (available from AllenNLP)

For type B project, you need to do the following:

  1. Choose the model you like.
  2. Read the accompanying paper to understand what’s going on in the model.
  3. Re-train the model as it is on the original dataset or train it with another available dataset.
  4. Think about how you can change the model, implement the changes and rerun the experiments
  5. Write a short report about your work.
  6. Submit the code and the report as your project.

Suggested project timeline

Mandatory deadlines are in red.

Track A - custom projectTrack B - AllenNLP modelDate
Confirm the topicConfirm the topic10.05.2019
Read the paper(s), implement your modelRead the paper, retrain the AllenNLP model17.05.2019
Implement your model, experimentImplement changes, experiment24.05.2019
Implement your model, experimentImplement changes, experiment31.05.2019
Submit project code and reportSubmit project code and report07.06.2019

Confirming project topic

Confirm your topic, either by telling Maksym in the practice session or emailing Kairit and/or Maksym. We will insert them into the following table:

Getting help

You can get help from both Kairit and Maksym with both choosing the project topic and while working on the project.

  • Kairit: by email, office hours (room 416; if you want to come but cannot make it at those times then please let me know and we'll figure something out):
    • 08.05.2019 3:30-5pm
    • 16.05.2019 12-2pm
    • 23.05.2019 12-2pm
    • 30.05.2019 12-2pm
    • 05.06.2019 12-2pm
  • Maksym: by email, via Piazza, during practice sessions
  • Institute of Computer Science
  • Faculty of Science and Technology
  • University of Tartu
In case of technical problems or questions write to:

Contact the course organizers with the organizational and course content questions.
The proprietary copyrights of educational materials belong to the University of Tartu. The use of educational materials is permitted for the purposes and under the conditions provided for in the copyright law for the free use of a work. When using educational materials, the user is obligated to give credit to the author of the educational materials.
The use of educational materials for other purposes is allowed only with the prior written consent of the University of Tartu.
Terms of use for the Courses environment