Principal advisor: Robert Kofler
Few people are aware that our genomes are filled with parasites. These so called transposable elements (TEs) selfishly spread in our genomes and may even cause diverse diseases. TEs have been remarkably successful: they occur in all eukaryotic species and constitute about 50% of our genomes.
TEs are also known to frequently invade novel species by horizontal transfer. They rapidly spread within afflicted populations, while the hosts struggle to contain their spread in order to avert major damage. However, after some time most hosts succeed in controlling the invasion of a TE, but it is largely unknown how this is exactly achieved. To shed light on this we will investigate TE invasions at high resolution using cutting edge technologies such as small RNA sequencing, RNA-Seq, phenotypic assays and Pool-Seq.
This work will allow the student to shed light on the invasion of parasitic DNA within a young and international team, get in contact with cutting-edge technologies and receive a first-rate training in bioinformatics. The project will involve 60% lab-work and 40% computer work. Some wet-lab experience (e.g. PCR) and programming skills (e.g. R or Python) would be beneficial.
Principal advisor: Magnus Nordborg
It has long been argued that epigenetic transgenerational inheritance could play an important role in adaptation (as well as in breeding and human complex trait variation). However, while the existence of such inheritance is no longer in doubt (at least not in plants, where DNA methylation can clearly be inherited; see Grossniklaus et al. 2013; Heard and Martienssen 2014), evidence that it plays an adaptive role is mostly limited to geographic correlations between methylation and the environment that are suggestive of selection, but prove little (Verhoeven, vonHoldt, and Sork 2016).
Our recent studies in Arabidopsis thaliana have uncovered remarkable geographic patterns that will enable us to address this issue (Dubin et al. 2015; Kawakatsu et al. 2016). Analysis of the genomes, transcriptomes, and methylomes of over 1,000 natural inbred lines grown in the same environment revealed genome-wide variation in DNA methylation that was strongly correlated with the environment of origin of the lines — implying either that they carry an epigenetic memory of their ancestral environment, or that the divergence in methylation is simply due to genetics. These two explanations are not mutually exclusive. In support of the former, DNA methylation is known to be heritable; in support of the latter, genome-wide association studies (GWAS) and crosses revealed that much of the variation clearly has a genetic basis.
My lab is currently trying to determine the extent to which these two explanations are responsible for the observed patterns, i.e., their proximal cause. We also seek to elucidate their ultimate cause, i.e., the evolutionary mechanisms that lead to such strong environment correlations. Research will involve analysis of large genomic data sets, and the requisite skills are required.
- Dubin, Manu J., Pei Zhang, Dazhe Meng, Marie-Stanislas Remigereau, Edward J. Osborne, Francesco Paolo Casale, Philipp Drewe, et al. 2015. “DNA Methylation in Arabidopsis Has a Genetic Basis and Shows Evidence of Local Adaptation.” Genomics. eLife 4 (May). doi:10.7554/eLife.05255.
- Grossniklaus, Ueli, William G. Kelly, Bill Kelly, Anne C. Ferguson-Smith, Marcus Pembrey, and Susan Lindquist. 2013. “Transgenerational Epigenetic Inheritance: How Important Is It?” Nature Reviews. Genetics 14 (3): 228–35.
- Heard, Edith, and Robert A. Martienssen. 2014. “Transgenerational Epigenetic Inheritance: Myths and Mechanisms.” Cell 157 (1). Elsevier Inc.: 95–109.
- Kawakatsu, Taiji, Shao-Shan Carol Huang, Florian Jupe, Eriko Sasaki, Robert J. Schmitz, Mark A. Urich, Rosa Castanon, et al. 2016. “Epigenomic Diversity in a Global Collection of Arabidopsis Thaliana Accessions.” Cell 166 (2): 492–505.
- Verhoeven, Koen J. F., Bridgett M. vonHoldt, and Victoria L. Sork. 2016. “Epigenetics in Ecology and Evolution: What We Know and What We Need to Know.” Molecular Ecology 25 (8): 1631–38.
Principal advisor: Joachim Hermisson
In nature, diverged populations or incipient species can often still exchange genetic material. This is particularly interesting if species exchange adaptive genes. Recent research has shown that adaptive introgression is indeed an important source for new adaptations in many animal and plant species. In particular, it has been shown that there has been adaptive gene flow from archaic humans (such as Neanderthals or Denisovans) to ancient modern humans.
Haplotype patterns after adaptive gene introgression. The aim of the planned project is to characterize the expected footprint of adaptive introgression and to develop statistical methods to detect such footprints in genome-wide polymorphism data. It has previously been shown that tests based on haplotype patterns are the most powerful ones to detect events of recent adaptation in human populations. We therefore will focus on the haplotype patterns generated by adaptive introgression. We will address question such as: What is the distribution of introgression tract lengths around the beneficial allele for a single successful introgression event? How many distinct pieces of introgressed material do we expect to find – and at which distance to the beneficial allele? How do linked deleterious alleles (which may either hitch-hike or not) affect these results? We will use analytical theory based on branching processes and coalescent simulations. Knowledge about the haplotype pattern is very relevant, in particular, to link any theoretical results to data. In the context of the graduate school, we will cooperate with Christian Schlötterer, who will study the introgression of single hot-temperature-adapted genotypes into a cold-adapted population under the hot selection regime. We will also apply the methods to Human data sets.
Macroevolutionary dynamics of selfish DNA unravelled by third generation sequencing
Principal advisor: Robert Kofler
Few people are aware that our genomes are filled with parasites. These so called transposable elements (TEs) selfishly spread in our genomes and may even cause diverse diseases. TEs have been remarkably successful: they occur in all eukaryotic species and constitute about 50% of our genomes. Despite the fundamental importance of TEs little is known about their long term evolution.
In this work we will sequence populations of closely related Drosophila species using the most recent long read technologies (PacBio and Oxford Nanopore). This will allow to unravel the evolutionary dynamics of TEs with unprecedented resolution and shed light on the following questions:
- How fast are these parasites spreading in genomes?
- Are invasions of these parasites influencing the large-scale evolution of genomes (e.g. inversions, translocations)?
- How frequent is the invasion of a novel genomic parasite in the course of evolution?
- Can we also find evidence for the death of a TE?
This work will allow the student to get in contact with cutting edge sequencing technologies, shed light on the evolution of genomes within a young and international team and receive a first-rate training in bioinformatics. The project will involve 20% lab-work and 80% computer work. Some programming skills (e.g. R or Python) and some wet-lab experience (e.g. PCR) would be beneficial.
New methods for modelling and analysis of data from experimental evolution
Principal Advisor: Andreas Futschik
As sequencing data are becoming less and less expensive, experiments to investigate the genomic response to selection are carried out at larger and larger scales. At the Vienna Graduate School of Population Genetics such experiments are carried out using Drosophila as a model organism. These experiments typically involve replicate populations and look for instance at unusual allele frequency changes. A further typical feature of such experiments is that whole pools of individuals are sequenced together at several time points. With more and more data coming in, it became apparent that adaptive signals are often more complicated than initially anticipated. This leads to new interesting methodological challenges with the analysis of such data.
The goal of the project will be the evaluation of existing methods, as well as the development of new statistical methods that help to investigate the genetic response to selection pressure, as occurring for instance when populations adapt to new environments.
The successful applicant should have good programming skills. As this project will take place at the interface between statistics and population genetics, interest in both areas will be important. Some existing knowledge in one of these fields will be a plus, but can also be acquired during the course of the PhD studies.
Statistical inference concerning population genetic parameters from repeated genomic measurement data
Principal Advisor: Andreas Futschik
High dimensional data, such as those produced from DNA and RNA sequencing experiments lead to exciting new challenges for the field of statistics. The proposed project is connected to the modelling of such data. Existing statistical methods will need to be adapted and new methods developed.
This Ph.D. project will provide a motivated student with the opportunity to participate in such research, and to apply new methods to analyze real population genetic data.
Nowadays, population genetics relies to a large extent on the analysis of next generation sequencing data. Such data are challenging to handle as they are high dimensional. The sample size can be fairly large, when whole pools of organisms are simulated simultaneously. When individual organisms are sequenced separately, they tend to be considerably smaller. A further characteristic of such data is the complex error structure, with error components including the sampling from the population, genetic drift, as well as sequencing and alignment errors. If whole pools of individuals are sequenced, this would add a further component of variance. Statistical models have to take these error components properly into account, in order to provide valid conclusions. Previous publications by Futschik et al. deal with the proper estimation of population genetic parameters (such as the mutation rate, the effective population size, and the population recombination rate) under such a setting.
An important issue is to disentangle local features such as selection from global features such as demography using genomic data at repeated time points. One question will be to estimate the proportion of the genome affected by selection. Empirical Bayes in the context of mixture models could be a promising approach to tackle such questions. A further question will be to investigate how estimates of global parameters are affected by local model deviations e.g. due to selection. Subsequently we want to optimize the separation between local and global features in terms of the performance of estimates for global parameters.
The adaptive value of diversity produced by recurrent whole genome doubling
Principal advisor: Ovidiu Paun
Whole genome doubling (WGD) and hybridization profoundly shaped plant genome evolution. However, most neopolyploids show poor fitness and fail to establish. To be successful, first generation allopolyploids must quickly adjust their genome and function, thereby altering their ecological properties and adaptive success, as a function of their environment. The duplicated nature of polyploids buffers more effectively deleterious alleles and provides genome-wide opportunities for adaptive evolution. Recurrent origins of polyploids are widespread and provide natural replicates to study mechanisms of rapid adaptation to divergent environments.
This project will combine molecular and ecological investigations in a fairly young polyploid complex in Dactylorhiza, comprising sibling terrestrial orchids with divergent ecological preferences. Specifically, to complement ongoing analyses of the nature of the extant molecular diversity in the D. majalis complex, we will interrogate the adaptive value of this diversity within reciprocal transplant experiments in the Alps and Scandinavia. We will shed light on the links between genotype, epigenotype and environmental conditions, by focusing on the environmental sensitivity of gene expression (with RNAseq) and of post-transcriptional regulation by small RNAs (with smRNAseq), exploring also in detail the link between DNA methylation patterns and expression of duplicated genes.