Project description
Ultra-high resolution protein structure on the basis of NMR measurements
Nuclear magnetic resonance (NMR) spectroscopy is essential for protein structure analysis allowing the measurement of the dynamics and structure of a protein under nearly physiological conditions. Recent studies on exact nuclear Overhauser enhancements (eNOEs) have enabled distance measurements in proteins by NMR with an accuracy of 0.1 Å. The EU-funded DeepNOE project aims to develop the innovative model/algorithm that automatically transforms raw NMR measurements into high-resolution protein structures that reveal multiple simultaneously populated conformational states in atomic detail. The study will include the application of deep learning, a novel field in machine learning that has revolutionised data science and artificial intelligence.
Objective
Nuclear Magnetic Resonance (NMR) spectroscopy is one of the leading techniques for protein structure analysis. In contrast to other methods, NMR spectroscopy allows the measurement of the dynamics and structure of a protein under nearly physiological conditions, without the need for crystallization or freezing of a sample. Recent studies on exact Nuclear Overhauser enhancements (eNOEs), carried out in the laboratory of the host professor, have enabled distance measurements in proteins by NMR with accuracy of 0.1 Å. This allows to determine structures in solution and in living cells with unprecedented resolution.
This biophysical achievement creates an outstanding opportunity for a computer scientist (the fellow candidate) to develop the first-of-its-kind model/algorithm that automatically transforms raw NMR measurements into high-resolution protein structures that reveal multiple simultaneously populated conformational states in atomic detail. This problem will be tackled with the use of deep learning (DL), a novel field in machine learning that has emerged after 2010 and has revolutionized data science and artificial intelligence.
The project is divided into 3 parts. First, it is planned to investigate recent advances in DL to derive a model that extracts visual information from 2D and 3D NMR spectra. Afterwards, the proposed model will be integrated into CYANA to formulate a hierarchical DL/optimization routine, which automates all steps of protein structure solving. Finally, it is planned to explore the possibility of calculating protein structures directly from NOESY spectra, which constitutes a new protocol for protein structure solving by NMR spectroscopy.
Summing up, the proposed DL approach has the potential to reduce the time required to solve proteins with NMR from months/years to days, while delivering very high resolution, multi-state structures. We expect this project to open new avenues in structural biology and drug discovery.
Fields of science
CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques.
CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques.
- natural sciencescomputer and information sciencesdata science
- natural sciencesbiological sciencesbiochemistrybiomoleculesproteins
- natural sciencesphysical sciencesopticsspectroscopyabsorption spectroscopy
- natural sciencescomputer and information sciencesartificial intelligencemachine learningdeep learning
- natural sciencesbiological sciencesmolecular biologystructural biology
Programme(s)
Funding Scheme
MSCA-IF-EF-ST - Standard EFCoordinator
8092 Zuerich
Switzerland