392107 | Parmigiani | Winter 2025/26 | Wednesday 10:15-11:45 | U10-146 |
This course is held in English.
Based on original research papers, the participants will give oral presentations (20-45 min) and write short summaries (5-10 pages) about algorithmic problems in bioinformatics and their solutions. Talks and essays must be done in English. The first day covers an overview of possible topics, which will then be distributed to the students. Aspects of scientific writing and presenting will be covered as well.
The overarching topic of this semester are k-mers (a.k.a. q-grams). This simple concept builds a basis for many algorithmic solutions in bioinformatics, such as assembly, alignment, genome comparison, pangenomics, etc.
To practice algorithm design and presentation, each participant will specify a simple k-mer counting algorithm and present it using pseudocode (a LaTeX template will be provided). Afterwards, they will implement the algorithm as a basic prototype. In a coding showdown, all implementations will battle it out to see which one takes the crown! (This can be credited as “392041 Implementation of Algorithms (Ü)”, 1 LP.)
Possible concrete methods/publications to be presented/discussed in the seminar are:
15.10. | – | |
22.10. | Organization, topic selection, Scientific reading | Slides: HowToRead |
29.10. | Pseudocode | algorithm2e-docu, LaTeX template (remove .pdf from file name), notes/example |
5.11. | – (self study) | Scientific Writing, Ten Simple Rules for Making Good Oral Presentations, Philip E Bourne, Ten simple rules for short and swift presentations, Christopher J. Lortie |
12.11. | – | |
19.11. | Pseudocode presentation | |
26.11. | k-mer counting challenge | Everybody |
3.12. | Mitan | Sparse and skew hashing of K-mers. |
10.12. | Kiril | Space-efficient and exact de Bruijn graph representation based on a Bloom filter. (Minia) |
17.12. | Nicola | Efficient q-gram filters |
14.1. | Tom | Tom Velvet: algorithms for de novo short read assembly using de Bruijn graphs. |
21.1. | Nils | Mash: fast genome and metagenome distance estimation using MinHash. |
28.1. | Mehran | Eulertigs: minimum plain text representation of k-mer sets without repetitions in linear time. |
4.2. | John | Fast gapped k-mer (cuckoo hash tables) |
25.2. | – | |
4.3. | – | |
11.3. | – | |
18.3. | – | |
25.3. | – |
Input:
Output: