Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

First use of probabilistic programming for hard problems in statistical phylogenetics

Project description

A probabilistic approach to the statistical analysis of phylogenetics

Statistical analysis of phylogenetic models is currently one of the most active research areas in computational biology, with wide applications in the theory of evolution, epidemiology and forensics. Existing computational approaches to inference based on Markov Chain Monte Carlo (MCMC) methods are not so efficient as Sequential Monte Carlo (SMC) inference algorithms. The goal of the EU-funded PhyPPL project is to apply probabilistic programming to automatically generate SMC inference models for those phylogenetic problems difficult to solve with MCMC methods. Specifically, the project will design statistical inference algorithms for complex diversification models with variable tree topology and a trait-dependent branching process. To demonstrate the potential of the new algorithms, the project will use them to trace the impact of the Andean orogeny on neotropical biodiversity.

Objective

Statistical analysis of phylogenetic models is one of the most active areas of research in computational biology today with wide applications in the Theory of Evolution, epidemiology, forensics, etc. Current phylogenetic software packages limit the user to the set of phylogenetic models and inference strategies that have been pre-programmed in the tool. Inference under certain important phylogenetic models is very difficult with the Markov chain Monte-Carlo strategy implemented in current packages for phylogenetic analysis. The new paradigm of probabilistic programming, coming from computational statistics and theoretical computer science, solves the model expression problem and enables the user to implement novel inference methods. We utilize probabilistic programming to automatically generate Sequential Monte Carlo (SMC) inference machinery for MCMC-hard problems in phylogentics. SMC algorithms may be more efficient, provide unbiased solutions, and provide likelihoods estimates for model comparison.

The goal of the proposed research is to carry out some of the first applications of probabilistic programming to real-world problems of empirical interest in evolutionary biology. The objectives are (1) to design and implement statistical inference machinery for complex diversification models with variable tree topology and a trait-dependent branching process under probabilistic programming, (2) to do a pilot study on the applicability of this inference machinery by studying the effect of the orogeny of the Andes on Neotropical biodiversity, and (3) contribute to the design and implementation of a novel probabilistic programming language for phylogenetics, TreePPL, by utilizing the insights gained from (1) and (2).

We also propose dissemination and communication measures that target scientists and the general public throughout Europe and in particular new and aspiring EU member states.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Keywords

Project’s keywords as indicated by the project coordinator. Not to be confused with the EuroSciVoc taxonomy (Fields of science)

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

MSCA-IF - Marie Skłodowska-Curie Individual Fellowships (IF)

See all projects funded under this funding scheme

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

(opens in new window) H2020-MSCA-IF-2019

See all projects funded under this call

Coordinator

NATURHISTORISKA RIKSMUSEET
Net EU contribution

Net EU financial contribution. The sum of money that the participant receives, deducted by the EU contribution to its linked third party. It considers the distribution of the EU financial contribution between direct beneficiaries of the project and other types of participants, like third-party participants.

€ 203 852,16
Address
Frescativägen 40
SE 114 18 Stockholm
Sweden

See on map

Region
Östra Sverige Stockholm Stockholms län
Activity type
Research Organisations
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

€ 203 852,16
My booklet 0 0