Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

Accurate reconstruction of microbial genomes from the environment

Project description

Improved binning improves the quality of metagenome-assembled genomes

Human and environmental microbiomes are diverse microbial communities that play essential roles in human health and ecosystem functioning. Our understanding of their genes, products and dynamic interactions has increased with the use of metagenomics, the study of the structure and function of entire nucleotide sequences obtained from all the organisms in a sample. However, the quality of the metagenome-assembled genomes (MAGs) reconstructed from metagenome data suffers from challenges with binning, which is the grouping of sequences into species-wise bins. With the support of the Marie Skłodowska-Curie Actions programme, the Metagenome binning project plans to address the problems with a novel binning algorithm that improves the binning of low-abundance and highly conserved sequences.

Objective

Metagenome-assembled genomes (MAGs) obtained from metagenomics are of fundamental value to understanding diverse ecological niches of microbes such as the human gut, with applications in medicine, biotechnology, and climate science. However, the quality of MAGs constructed with state-of-the-art tools is often unsatisfactory and worse than the self-reported quality. The main source of error is binning, a computational step that groups sequences assembled from short sequencing reads (contigs) into species-wise bins. The two chief challenges are accurately binning (1) genomes with low abundance and (2) highly conserved regions. Due to cross-mapping of reads, the contigs from conserved regions appear to have abundances equal to the sum of the abundances of the related species or strains. As conventional binning tools all rely on clustering contigs according to their abundances across samples, conserved regions end up forming separate bins. Besides, most existing methods optimise quality measures (purity and completeness based on conserved marker genes) and assess the final quality on these very measures, leading to highly optimistic results. I aim to solve these problems by developing a binning algorithm that applies
i) linear mixture models using non-negative matrix factorization to account for cross-mapping,
ii) Poisson statistics to accurately model low abundance, and
iii) Bayesian statistics-based multinomial clustering to calculate bin numbers. Importantly, it does not require marker gene-based quality measures for binning.
By improving the binning of low-abundance and highly conserved contigs, this approach should yield more high-quality MAGs, thereby enhancing a multitude of downstream metagenomic analyses for all areas of microbiome research.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.
This project's classification has been validated by the project's team.

Keywords

Project’s keywords as indicated by the project coordinator. Not to be confused with the EuroSciVoc taxonomy (Fields of science)

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

HORIZON-TMA-MSCA-PF-EF - HORIZON TMA MSCA Postdoctoral Fellowships - European Fellowships

See all projects funded under this funding scheme

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

(opens in new window) HORIZON-MSCA-2022-PF-01

See all projects funded under this call

Coordinator

MAX-PLANCK-GESELLSCHAFT ZUR FORDERUNG DER WISSENSCHAFTEN EV
Net EU contribution

Net EU financial contribution. The sum of money that the participant receives, deducted by the EU contribution to its linked third party. It considers the distribution of the EU financial contribution between direct beneficiaries of the project and other types of participants, like third-party participants.

€ 189 687,36
Address
HOFGARTENSTRASSE 8
80539 MUNCHEN
Germany

See on map

Region
Bayern Oberbayern München, Kreisfreie Stadt
Activity type
Research Organisations
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data
My booklet 0 0