Periodic Reporting for period 2 - PROT-STRUCT-DISEASE (Protein Structure, Molecular Mechanisms and Human Genetic Disease: Beyond the Loss-of-function Paradigm)
Período documentado: 2022-12-01 hasta 2024-05-31
In Aim 1, we will use structural bioinformatics techniques to explore the properties of dominant-negative, gain-of-function and loss-of-function variants, and learn how they are related to protein structure and disease phenotypes. In Aim 2, we will perform deep mutational scanning experiments (DMS) on human disease genes, enabling us to measure effects and elucidate molecular mechanisms for tens of thousands of single amino-acid substitutions. Finally, in Aim 3, we will seek to improve the prediction and prioritisation of pathogenic and other phenotypically important variants. This will involve assessing current approaches for predicting variant effects, implementing our own computational variant prioritisation pipeline and meta-predictor, and using our new understanding of molecular mechanisms to integrate computational predictors and DMS data with structural and other protein-level features. We will also demonstrate the power of our approach in application to sequencing data from clinical and population studies.
For Aim 2, we have now performed complete DMS experiments for two of our originally stated targets, with multiple other targets now in various stages of progress. Analysis of this data has highlighted striking differences in its utility for dominant vs recessive genetic disorders, and has inspired a new biallelic approach to data interpretation that shows great benefit for recessive disease. Publications describing this work are aimed to be submitted very soon.
With respect to Aim 3, we have completed a pair of closely related studies where we investigated the relationships between a large number of DMS experiments, and the outputs of computational methods for predicting either variant pathogenicity, or protein stability changes upon mutation. We were able to identify specific tools as performing best for either task, and also showed how the correspondence with protein stability effects depends greatly on the type of DMS experimental assay. We have also released a new computational method for predicting the molecular mechanisms most likely to be associated with different human genes, which is crucial for our ongoing efforts to establish an improved, mechanism-centric variant assessment pipeline.
Our results so far suggest that non-loss-of-function variants are even more prevalent, and more difficult to predict the effect of, than we thought at the start of the project. By the end of the project, the knowledge we will have learned, the tools we will have developed, and the experimental data we will have measured, will lead to a great improvement in our ability to identify damaging genetic variants and thus diagnose human genetic disease.