This is an old revision of the document!


Bachelor and Master Thesis Topics within the AG Genome Informatics


The following is a (probably incomplete) list of thesis topics offered within the Genome Informatics group. If you are interested in working on one of these (or anything else in Genome Informatics), please contact the responsible group members or see Prof. Jens Stoye.

Alignment-free Phylogenomics with Copy Number Variations (Bachelor)

Andreas Rempel, Roland Wittler (Please also refer to the project page: SANS serif)

SANS serif is a software that allows the reconstruction of a phylogenetic network from a set of input genomes. For this purpose, the current implementation explores the presence/absence of k-mers in the DNA sequences. However, this method is not capable of detecting copy number variations, which would require counting and comparing the number of occurrences of each k-mer in each input sequence. The aim of this project is an efficient implementation of this extended method, but also an evaluation as to whether it improves the quality of the resulting phylogenetic network.
Good knowledge of C++ is highly recommended.

Improvement of Sequence-to-Graph Alignment (Bachelor)

Tizian Schulz (Please also refer to the project page: PLAST)

PLAST is a new heuristic method to find maximum scoring local alignments of a DNA query sequence to a pangenome represented as a compacted colored de Bruijn graph. The first method has been published here, but there exist various ideas how to improve the method. Some are well suited for a Bachelor thesis. Contact Tizian for details.

Medical Bioinformatics Projects in Australia (Master)

Jens Stoye, Lutz Krause (University of Queensland Diamantina Institute, Brisbane, Australia)

In collaboration with the Computational Clinical Genomics Group at the University of Queensland Diamantina Institute, Brisbane, Australia, we develop and apply bioinformatics and data-mining methods in the context of biomedical research. Student projects are available in the context of biomarker discovery for cancer, genome-wide epigenetic association studies, analysis of next-generation sequencing data, mining and visualizing the human microbiota, evolutionary genomics of human parasites, calling structural variants from next-generation sequencing data and genome-wide association studies (GWAS).