Single-Course English 5 ECTS

Algorithms in bioinformatics, PhD

Overall Course Objectives

To provide the student with an overview and in-depth understanding of bioinformatics machine-learning algorithms. Enable the student to first evaluate which algorithm(s) are best suited for answering a given biological question and next implement and develop prediction tools based on such algorithms to describe complex biological problems such as immune system reactions, vaccine discovery, disease gene finding, protein structure and function, post-translational modifications etc.

Learning Objectives

  • understand the details of the algorithms commonly used in bioinformatics.
  • develop computer programs implementing these algorithms.
  • identify which type of algorithm is best suited to describe a given biological problem.
  • understand the concepts of data redundancy and homology reduction.
  • develop bioinformatics prediction algorithms describing a given biological problem.
  • implement and develop prediction tools, on a detailed level, using the following algorithms: Dynamic programming, Sequence clustering, Weight matrices, Artificial neural networks, and Hidden Markov models.
  • design a project where a biological problem is analyzed using one or more machine learning algorithms.
  • implement, document and present the course project.

Course Content

The course will cover the most commonly used algorithms in bioinformatics. Emphasis will be on the precise mathematical implementation of the algorithms in terms of functional computer programs. During the course, biological problems will be introduced and analyzed with the purpose of highlighting the strengths and weaknesses of the different algorithms. The following topics will be covered:

Dynamic programming: Needleman-Wunsch, Smith-Waterman, and alignment heuristics
Data redundancy and homology reduction: Hobohm and other clustering algorithms
Weight matrices: Sequence weighting, pseudo count correction for low counts, Gibbs sampling, and Psi-Blast
Hidden Markov Models: Model construction, Viterbi decoding, posterior decoding, and Baum-Welsh HMM learning
Artificial neural networks: Architectures and sequence encoding, feed-forward algorithm, back propagation and deep neural networks

The course will consist of lectures, discussion sessions and computer exercises, where the students will be introduced to the different algorithms, their implementation and use in analyzing biological problems. In the end of the course, the student will work on a group project were one or more of the algorithms introduced in the course are applied to analyze a biological problem of interest. The project report shall be written as a research paper including an in-depth review of the field covered by the project.

Recommended prerequisites

22111/22110, or equivalent. Able to solve bioinformatic problems in a structured manner using simple programs and scripts in Unix environment, for example during project work.

Teaching Method

Lectures, discussions, exercises and project


See course in the course database.





3 weeks


Health Tech


DTU Lyngby Campus

Course code 22175
Course type PhD
Semester start Week 23
Semester end Week 26
Days Mon-fri 8:00-17:00

10.600,00 DKK