Periodic Reporting for period 2 - GENECLOCKS (Reconstructing a dated tree of life using phylogenetic incongruence)
Reporting period: 2019-01-01 to 2020-06-30
The first goal of the GENECLOCKS project is to develop methods. Methods that systematically extract information on the pattern and timing of genomic evolution by explaining differences between gene trees. These methods will allow us to, for the first time, reconstruct a dated tree of life from genome-scale data. We use parallel programming and computer science algorithms to maximize the number of genomes analyzed.
The second goal of the GENECLOCKS project is to apply these methods to open problems.
In our 2018 publication ""Gene Transfers can date the Tree of Life"" we applied our method to thousands of gene families from a diverse set of organisms: 40 species of the oxygen-producing photosynthetic bacteria called cyanobacteria, 60 species of single-celled microorganisms called archaea, and 60 species of fungi.
For more details download our publication from here: http://rdcu.be/KrjA and see the article ""Chronological Clues to Life’s Early History Lurk in Gene Transfers"" describing it in Quanata Magazine: https://www.quantamagazine.org/chronological-clues-to-lifes-early-history-lurk-in-gene-transfers-20180424/
As shown in the attached figure, one of the dates we were able to better resolve was the emergence of the Asgard group of Archaea, which we believe gave rise to the Eukaryotes from pine trees to human beings.
We also published two important papers in collaboration with Tom Willimas at the University of Bristol that help to resolve the evolutionary history of Archeae and the position of Eukaryotes among them. In our paper published in PNAS ( https://www.pnas.org/content/114/23/E4602 ) we applied a new approach that harnesses the information in patterns of gene family evolution to find the root of the archaeal tree and to resolve the metabolism of the earliest archaeal cells, which lived over 3 billion years age. Our approach robustly distinguished between published rooting hypotheses, suggested that the first Archaea were anaerobes that may have fixed carbon via the Wood–Ljungdahl pathway, and quantifies the cumulative impact of the horizontal transfer on archaeal genome evolution. In a second paper (https://www.nature.com/articles/s41559-019-1040-x) we showed that eukaryotes consistently originate from within the archaea in a two-domains tree when due consideration is given to the fit between model and data. Our analyses support a close relationship between eukaryotes and Asgard archaea and identify the Heimdallarchaeota as the current best candidate for the closest archaeal relatives of the eukaryotic nuclear lineage.
We also developed and made available GeneRax (https://github.com/BenoitMorel/GeneRax) a parallel tool for species tree-aware maximum likelihood-based gene tree inference under gene duplication, transfer, and loss. It infers gene trees from their aligned sequences, the mapping between genes and species, and a rooted updated species tree."