Basic guidelines
Submission
- Homeworks are submitted and grades via the Gradescope website.
- Results should be submitted as a single PDF file. Submission in other formats will not be accepted.
- For homeworks written in the R language, you might find this RMarkdown homework template useful.
- Homeworks need to be submitted by the deadline announced here and on the Gradescope website. Submissions received 1 day late will lose 50% of the points, later submissions will not be accepted. If you miss a deadline, you can earn extra points from bonus exercises that will be posted occasionally.
Plagiarism
Bioinformatics course follows exactly the same rules that applied to the Data Mining course.
The homeworks are meant to be solved alone. If you struggle with something you are welcome to write to the Piazza forum, attend the consultations or ask directly from TA's. You are of course allowed to discuss the main ideas with other students, but you have to solve the tasks by yourself, it shouldn't be a team effort. This is necessary because of the format of the course, which is very strongly based on individual work so if you don't do it you will not acquire all the knowledge you need during the course.
Usually during the course we use existing functionality of R and Python (or something else) for the data mining algorithms and you are not forced to implement them yourself (there can be exceptions). If a considerable part of your solution is based on some material you found online (more than just how to use some function), you should definitely add a citation to that source, otherwise it could also be called as plagiarism.
So in principle:
Do your work yourself. Do not share your work with others, if they need help, give them hints or guidelines. If you get a lot of help from some online source, cite it! If you get caught with a clear case of plagiarism:
If there are no previous problems depending on the situation you might get away with just getting 0 points for the task and get a warning from us. If the problem appears many times or we know that you have already warnings from other courses, an official warning follows. If you have already other official warnings, this can lead to expelling. We will be using automatic plagiarism detection programs to avoid and detect these situations!
Datasets
- Raw RNA-seq data
- Full gene expression dataset as SummarizedExperiment object
- Small gene expression dataset in which some of the samples have been swapped
- Transcript expression estimates from Salmon
- BigWig files
- Eigengene values
Homeworks
- Homework 1 - Deadline 28 February 2018 @ 11:59PM
- Homework 2 - Deadline 9 March 2018 @ 11:59PM
- Bonus tasks 1 - Deadline 16 March 2018 @ 11:59PM
- Homework 3 - Deadline 16 March 2018 @ 11:59PM
- Homework 4 - Deadline 23 March 2018 @ 11:59PM
- Homework 5 - Deadline 6 April 2018 @ 11:59PM
- Homework 6 - Deadline 13 April 2018 @ 11:59PM
- Homework 7 - Deadline 20 April 2018 @ 11:59PM
- Homework 8 - Deadline 27 April 2018 @ 11:59PM