The 16th Bioinformatics Research and Education Workshop

May 7-8, 2018, hosted by:

Bielefeld University, Germany Room X-B2-103

Map of University Campus here. (We are in building X.)

Map of X Building here. (We are in section B, floor 2, room 103.)

Monday 7.5.2018

8:30-9:00	Registration & Coffee
9:00-9:10	Welcome Address
9:10-9:40	Ahto Salumets: Comparative Genome-Wide Methylation Analysis of Regulatory and Conventional T Cells and Regulatory T Cells in Healthy Individuals and Graves' Patients
9:40-10:10	Elzbieta Gralinska: Combining Incoherent RNA-Seq Datasets on Example of Cerebral Organoids
10:10-10:40	Asan Meera Sahib Haja Mohideen: tRDFdbase: A Comprehensive Database of tRNA-Derived RNA Fragments
	Coffee Break
11:00-12:00	Keynote: Alessandro Mammana, Google Zurich: After the PhD, Chronicles Outside of Academia
	Lunch Break: Restaurant Nordlicht, X-E0
13:30-14:00	David Heller: SVIM: Structural Variant Identification Method using Long Reads
14:00-14:30	Bianca Frommer: Measuring the Extent of Haplotype Phase Accuracy in Genome Assemblies of Highly Heterozygous Organisms
14:30-15:30	Keynote: Alexander Sczyrba, Bielefeld University: Critical Assessment of Metagenome Interpretation (CAMI)
	Olderdissen Animal Park
19.30	Dinner at brewhouse Joh. Albrecht: Hagenbruchstraße 8, 33602 Bielefeld

Tuesday 8.5.2018

10:00-10:30	Coffee
10:30-11:00	Tõnis Tasa: External Evaluation of Population Pharmacokinetic Models for Vancomycin in Neonates with DosOpt
11:00-11:30	Francesco Delogu: Multiomics Network Representation of a Simplistic Microbial Community Time Series
11:30-12:30	Keynote: Michael Sammeth, Federal University of Rio de Janeiro: Functional Genomics in Organisms across the Tree of Life
	Lunch Break: Restaurant Nordlicht, X-E0
14:00-14:30	Riku Walve: Kermit: Guided Long Read Assembly using Coloured Overlap Graphs
14:30-15:00	Mikhail Papkov: Deep Learning for Cell Phenotyping
15:00-16:00	Keynote: Barbara Hammer, Bielefeld University: Feature Selection in the Context of Feature Correlations and Feature Redundancies
	End with Coffee

Abstracts of Keynote Talks

After the PhD, Chronicles Outside of Academia

by Alessandro Mammana

For some doctoral candidates, postdoctoral education and scientific research may appear as the natural continuation of their PhD in bioinformatics. For others, leaving the academic world behind and working in a company may look like a better opportunity. This talk is about my career path, which brought me from a PhD in bioinformatics to a job as a bioinformatics scientist working at Illumina and currently to a job as software engineer working at Google. I will discuss what I like about research and what I like about industry, the lessons that I learned and the opportunities that the PhD opened up for me.

Critical Assessment of Metagenome Interpretation (CAMI)

by Alexander Sczyrba

Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome data, but a lack of consensus about benchmarking complicates performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on complex data sets, generated from ~700 newly sequenced microorganisms and ~600 novel viruses and plasmids and representing common experimental setups. In my talk I will present results from the first CAMI challenge for assembly, binning and profiling tools. These results highlight current limitations of metagenomic tools, but also provide a roadmap for software selection to answer specific research questions.

Functional Genomics in Organisms across the Tree of Life

by Michael Sammeth

Ten years ago, RNA-seq turned up as a revolutionary tool for monitoring the transcriptome compositions of arbitrary cell types and states, opening up many novel possibilities for studies on functional genomics. This also gave rise to several multi-national projects to conduct RNA-Seq at large scale in different human cells, in order to advance with our understanding of how the genomic information influences our phenotypes, even after death. Furthermore, the combined RNA-sequencing of host cells together with pathogens (“Dual-Seq”) has recently demonstrated the capacity to monitor the cross-talk between two organisms during an infection. Functional studies in non-model organisms, however, are usually complicated primarily by their incomplete genomic and transcriptomic information. In this talk, we will discuss different possibilities to support RNA-seq studies by bioinformatics paradigms, and we discover some interesting biology along our way.

Feature Selection in the Context of Feature Correlations and Feature Redundancies

by Barbara Hammer

Feature selection methods aim for an automated identification of the most relevant characteristics for a given regularity based on given examples alone such as the automated identification of the potentially most relevant biomarkers in a biomedical diagnosis task. While a plethora of methods exists which can efficiently identify clear signals, the situation turns out more complicated in the realm of feature correlations and feature redundancies: on the one hand, it is not possible to exhaustively test subsets of features due to the involved combinatorial explosion; on the other hand, it is unclear how to interpret found feature relevances if features are potentially redundant hence the identified features are not unique.

In the talk, I will present a metric learning technology which can identify relevant sets of features via an adaptive feature relevance learning scheme which is integrated into the machine learning classifier. On the other hand, I will discuss effects which can occur in the case of redundant features, i.e. relevant features sets are not unique, and the results of standard feature selection technologies can be highly misleading. I present the so-called all relevant feature selection problem, and present novel machine learning technology how to solve this exemplary biomedical applications.

Genome Informatics