Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
thesistopics [2020/09/14 14:19]
rwittler
thesistopics [2025/03/31 17:32] (current)
arempel [<TITLE> (Bachelor/Master)]
Line 5: Line 5:
 </​WRAP>​ </​WRAP>​
  
-===== Medical Bioinformatics Projects in Australia (Master) ===== 
-[[https://​ekvv.uni-bielefeld.de/​pers_publ/​publ/​PersonDetail.jsp?​personId=65864|Jens Stoye]], [[https://​genomics.uq.edu.au/​profile/​263/​lutz-krause|Lutz Krause]] //​(University of Queensland Diamantina Institute, Brisbane, Australia)//​ 
  
-In collaboration with the Computational Clinical Genomics Group at the University of Queensland Diamantina InstituteBrisbaneAustralia, we develop and apply bioinformatics and data-mining methods in the context of biomedical researchStudent projects are available in the context of biomarker discovery for cancer, genome-wide epigenetic association studies, analysis of next-generation sequencing data, mining ​and visualizing the human microbiota, ​evolutionary ​genomics of human parasites, calling structural variants from next-generation sequencing data and genome-wide association studies (GWAS).+===== SAT solutions for rearrangement problems (Bachelor/​Master) ===== 
 +Genomic rearrangements play a critical role in evolution and adaptationaltering genome structure and organizationthereby influencing phenotypic traitsUnderstanding these rearrangements helps identify ​the mechanisms underlying many genetic diseases ​and evolutionary ​processes.
  
-===== Alignment-free Phylogenomics with Copy Number Variations (Bachelor) ===== +Howevereven quantifying rearrangements for most genomes that occur in practice is NP-hard under realistic models. 
-[[https://​ekvv.uni-bielefeld.de/​pers_publ/​publ/​PersonDetail.jsp?​personId=108064501|Andreas Rempel]][[https://​ekvv.uni-bielefeld.de/​pers_publ/​publ/​PersonDetail.jsp?​personId=3721521|Roland Wittler]] //(Please also refer to the project ​page: [[https://​gitlab.ub.uni-bielefeld.de/​gi/​sans|SANS serif]])//+While Integer Linear Programming ​(ILP) is commonly used to address this challenge, it has been recently observed that SAT formulations of the same problems may offer a faster alternative. The project ​involves converting an existing ILP solution for genomic rearrangement quantification into a SAT-based solution, leveraging SAT solvers'​ computational efficiency.
  
-SANS serif is a software that allows the reconstruction of a phylogenetic network from a set of input genomesFor this purpose, the current implementation explores the presence/absence of //k//-mers in the DNA sequencesHoweverthis method ​is not capable of detecting copy number variationswhich would require counting ​and comparing ​the number ​of occurrences of each //k//-mer in each input sequence. The aim of this project is an efficient implementation of this extended ​method, but also an evaluation as to whether ​it improves the quality of the resulting phylogenetic network.\\ Good knowledge of C++ is highly recommended.+Contact [[https://​ekvv.uni-bielefeld.de/pers_publ/publ/PersonDetail.jsp?​personId=129443261 |Leonard]] for details. 
 +===== Plasmid recovery/assembly from long reads (Bachelor/Master) ===== 
 +Despite advances ​in long-read sequencing technologies,​ plasmid assembly remains a significant challengeCurrent state-of-the-art assemblers often fail to recover plasmidsespecially those smaller than 10‎kb. This limitation ​is concerning because small plasmids often harbor important virulence and antimicrobial resistance genes. Popular assemblers such as Flye, Miniasm, Raven, and Canu show variable success, with recovery rates dropping dramatically for smaller plasmids. Furthermore,​ when plasmids are recovered, they often appear as multiple copies or are misassembled into the chromosome, highlighting fundamental limitations ​of current assembly algorithms. We are developing a novel pangenomic approach that uses gene identification within long reads and //k//-mers over the gene alphabet to improve the quality of the assembly graph. The main tasks for the students will be to analyze the performance ​of this method ​and to potentially improve ​it. ‎ Contact [[https://​ekvv.uni-bielefeld.de/​pers_publ/​publ/​PersonDetail.jsp?​personId=108064501|Andreas]] for details. 
 +===== <​TITLE>​ (Bachelor/​Master) ===== 
 +<Project description>​
  
-===== Alignment-free Phylogenomics using Amino Acid Sequences (Bachelor) ===== 
-[[https://​ekvv.uni-bielefeld.de/​pers_publ/​publ/​PersonDetail.jsp?​personId=108064501|Andreas Rempel]], [[https://​ekvv.uni-bielefeld.de/​pers_publ/​publ/​PersonDetail.jsp?​personId=3721521|Roland Wittler]] //(Please also refer to the project page: [[https://​gitlab.ub.uni-bielefeld.de/​gi/​sans|SANS serif]])// 
- 
-SANS serif is a software that allows the reconstruction of a phylogenetic network from a set of input genomes. For this purpose, the current implementation explores the presence/​absence of //k//-mers in the DNA sequences. However, for annotated genomes or to explore the evolutionary relationship of individual genes, it might be beneficial to perform the study at the level of the amino acid sequences. The aim of this project is an efficient implementation of this extended method, but also an evaluation as to whether it improves the quality of the resulting phylogenetic network.\\ Good knowledge of C++ is highly recommended.