Metabolic phenotypes are influenced by intrinsic and environmental factors that determine health status and disease risks of an individual or group. Measuring and modelling of the metabolites in an individual provides insights into disease factors and etiology that can used for personalised medicine. The analysis, however, is extremely demanding and subject to statistical and computational challenges. The PhenoMeNal (Phenome and Metabolome aNalysis) project addresses these challenges by providing a comprehensive and standardised e-infrastructure that supports data processing and analysis pipelines for the massive amounts of medical molecular phenotype data generated by metabolomics applications. As such, the PhenoMeNal infrastructure provides services to the European Biomedical Community enabling computation and analysis to improve the overall understanding of the causes and mechanisms underlying health, healthy ageing and disease.
The PhenoMeNal infrastructure can also be easily reused in other domain fields, requiring only the implementation of container images and wrappers for the tools of the desired domain (proteomics, genomics, astronomy, etc), according to our guidelines. It covers all layers of the data analysis workflow, from the earliest point of data acquisition to the generation of scientific knowledge. With standardised and well-tested workflows, accessible through the PhenoMeNal Virtual Research Environment (VRE) portal, PhenoMeNal aims to enable frictionless data access to scientists with appropriate credentials by providing Findable, Accessible, Interoperable and Reusable (FAIR) datasets.
Patient and research subject data are very sensitive, and it is paramount importance to establish a robust governance framework for overall information management including sensitive data. The PhenoMeNal e-infrastructure also ensures that all data collected and held within the project complies with local laws, regulations and ethics.
The overall objectives of the project are:
1. To use existing open source community standards, integrate tools, resources and methods for the management, dissemination and computational analysis of very large datasets of human metabolic phenotyping and genomic data into a secure and sustainable e-Infrastructure
2. To operate and consolidate the PhenoMeNal e-infrastructure based on existing internal and external HPC (high-performance computing), cloud, and grid resources, including the EGI and the EGI Federated Cloud, and to extend it to world-wide computational infrastructures;
3. To improve and scale-up tools used within the infrastructure to cope with very large datasets;
4. To establish technology for a water-tight audit trail for the processing of human metabolic phenotyping data from the raw data acquisition all the way to the generation of high-level biomedical insights (such as a medical diagnosis);
5. To establish privacy-protection methods that allow working with highly sensitive molecular phenotype data;
6. To foster the worldwide adoption of PhenoMeNal through a wide range of outreach, dissemination, networking and training activities;
7. To develop a model to ensure sustainability of the PhenoMeNal network.