European Commission logo
polski polski
CORDIS - Wyniki badań wspieranych przez UE
CORDIS

PhenoMeNal: A comprehensive and standardised e-infrastructure for analysing medical metabolic phenotype data

Periodic Reporting for period 2 - PhenoMeNal (PhenoMeNal: A comprehensive and standardised e-infrastructure for analysing medical metabolic phenotype data)

Okres sprawozdawczy: 2017-03-01 do 2018-08-31

Metabolic phenotypes are influenced by intrinsic and environmental factors that determine health status and disease risks of an individual or group. Measuring and modelling of the metabolites in an individual provides insights into disease factors and etiology that can used for personalised medicine. The analysis, however, is extremely demanding and subject to statistical and computational challenges. The PhenoMeNal (Phenome and Metabolome aNalysis) project addresses these challenges by providing a comprehensive and standardised e-infrastructure that supports data processing and analysis pipelines for the massive amounts of medical molecular phenotype data generated by metabolomics applications. As such, the PhenoMeNal infrastructure provides services to the European Biomedical Community enabling computation and analysis to improve the overall understanding of the causes and mechanisms underlying health, healthy ageing and disease.
The PhenoMeNal infrastructure can also be easily reused in other domain fields, requiring only the implementation of container images and wrappers for the tools of the desired domain (proteomics, genomics, astronomy, etc), according to our guidelines. It covers all layers of the data analysis workflow, from the earliest point of data acquisition to the generation of scientific knowledge. With standardised and well-tested workflows, accessible through the PhenoMeNal Virtual Research Environment (VRE) portal, PhenoMeNal aims to enable frictionless data access to scientists with appropriate credentials by providing Findable, Accessible, Interoperable and Reusable (FAIR) datasets.
Patient and research subject data are very sensitive, and it is paramount importance to establish a robust governance framework for overall information management including sensitive data. The PhenoMeNal e-infrastructure also ensures that all data collected and held within the project complies with local laws, regulations and ethics.
The overall objectives of the project are:
1. To use existing open source community standards, integrate tools, resources and methods for the management, dissemination and computational analysis of very large datasets of human metabolic phenotyping and genomic data into a secure and sustainable e-Infrastructure
2. To operate and consolidate the PhenoMeNal e-infrastructure based on existing internal and external HPC (high-performance computing), cloud, and grid resources, including the EGI and the EGI Federated Cloud, and to extend it to world-wide computational infrastructures;
3. To improve and scale-up tools used within the infrastructure to cope with very large datasets;
4. To establish technology for a water-tight audit trail for the processing of human metabolic phenotyping data from the raw data acquisition all the way to the generation of high-level biomedical insights (such as a medical diagnosis);
5. To establish privacy-protection methods that allow working with highly sensitive molecular phenotype data;
6. To foster the worldwide adoption of PhenoMeNal through a wide range of outreach, dissemination, networking and training activities;
7. To develop a model to ensure sustainability of the PhenoMeNal network.
1. Establishment of VRE portal based on our iterative UX design process
2. 1st PhenoMeNal release (codename: Alanine) with two example use cases demonstrating the two initial services Galaxy (standardised workflows) and Jupyter (exploratory data analysis).
3. Industry-grade orchestration and deployment
4. VRE Deployment on Amazon, Google Cloud as well as public and private OpenStack clusters
5. PhenoMeNal Service catalogue “App Library" with 60 metabolomics tools
6. Various data management software tools developed in the ISA API, mzml2isa and nmrml2isa tools
7. Established Galaxy public instance hosted at the EMBL-EBI EMBASSY cloud
8. Comprehensive documentation and tutorials for both developers and end users available via website and portal
9. Help desk for users
10. First working versions of various analysis pipelines: Sacurine data analysis use-case, Fluxomics use-case and OpenMS Uppsala use-case
11. Cloud-ready VMI’s for the Galaxy and Jupyter web applications.
12. Established Process for checking ELSI compliance for datasets
13. Extensive outreach in workshops, hackathons, video tutorials
14. Project progress in line with KPI’s
15. A local installation of the Phenomenal compute environment at Imperial College London for processing sensitive data
16. 2nd major release with the name Bucetin was made public in August 2018. This release, included over 150 tools and 4 pre-assembled workflows.
17. An exhibition booth was also hosted during the MetaboMeeting 2017 conference in Birmingham. The booth facilitated recruitment of participants for the UX testing scheduled for Spring 2018
18. The February 2018 release Cerebellin included 179 Galaxy modules/tools, 47 containers, 7 workflows and 2 Galaxy tours. It represented a major upgrade to Bucetin release.
19. The most current release during the project period Dalcotidine was made publicly available in August 2018. The release included over 250 Galaxy tools, 3 new pre-assembled workflows including a Galaxy visualization plugin to help users to visualise ISA-Tab datasets within Galaxy.
1. Provision of user friendly methods and tools for data analysis that can give novel biological and clinical insights, providing impact in the clinical and public health arena.
2. Usage of our cloud infrastructure in other domains only require implementation of simple wrappers for the respective tools.
3. Contribution to standardisation of Metabolomics computing in Europe and beyond.
4. Launched Metabolomics FAIR data node in GoFAIR. A 2-day strategic workshop entitled “Establishing a node for FAIRifying metabolomics and phenomics data in the European Open Science Cloud” was hosted by Leiden University (LU) in March 2017. As a successful outcome of the meeting, the “Metabolomics Implementation Network” of the European Open Science Cloud was launched.
5. Model system for ELIXIR cloud strategy – ELIXIR provides an excellent platform to enable interaction with other OMICS areas.
6. On path to be the official Metabolomics representative for EOSC
7. Established as a use case in ELIXIR. At the same time also Proteomics and GALAXY were added as new communities in ELIXIR, with both communities there are strong interactions with Metabolomics community.
8. European Metabolomics Infrastructure Foundation established. The most important recent action for the sustainability of PhenoMeNal has been the setting up the European Metabolomics Infrastructure Foundation in Summer 2018, for guiding the further development of PhenoMeNal.
9. ELIXIR Metabolomics community established
10. ELIXIR Implementation study on Metabolite identification granted (18 months, involving 11 ELIXIR nodes), kick-off meeting organised in Rotterdam on September 18/19 2018.
11. Galaxy community adopted cloud focused developments for inclusion in the standard Galaxy platform (K8S)
12. PhenoMeNal is represented in the Swedish ELIXIR node, which has lead to transfer of competence and the use of KubeNow for cloud contextualisation to other ELIXIR-SE projects. ELIXIR-SE is the member in the nordic Tryggve2 project for working with sensitive data within the nordic countries where PhenoMeNal has been selected as one of the case studies during 2017-2020
PhenoMeNal logo
Main Components of PhenoMeNal