Periodic Reporting for period 1 - EnzVolNet (COMPUTATIONAL EVOLUTION OF ENZYME VARIANTS THROUGH CONFORMATIONAL NETWORKS)
Reporting period: 2017-05-01 to 2019-04-30
Markov State Models were also applied to the second proposed enzyme model, TrpS, a more complex α2β2 heterodimer enzyme system catalyzing the condensation of indole and L-serine to form L-tryptophan. In this case, MSM failed to reproduce the conformational dynamics of the open-to-closed COMM domain transition leading to improved catalysis in the evolved variants. Therefore, the contingency plan based on metadynamics was applied. In particular, metadynamics in combination with path-collective variables were used to recover the free-energy profiles associated with the TrpS and TrpS0B2 evolved variant open-to-closed COMM domain transition, at different reaction intermediates. Similar to LovD, energy profiles revealed an stabilisation or population shift towards closed/active COMM domain conformations as the enzymatic reaction progresses. Moreover, mutations introduced in the stand-alone TrpS0B2 variant were able to retain and improve the conformational plasticity observed in the WT TrpS, thus explaining the higher catalytic activity of the stand-alone variant.
The second research goal was to develop new computational tools to identify which residues affect the enzyme active site conformational dynamics, unveiling the most important positions for stand-alone enzyme activity and substrate scope. Computational methods, especially Molecular Dynamics (MD) simulations, are particularly useful to recover the motions of enzymes critical for its function. However, the large number of degrees of freedom present in biomacromolecules, including enzymes, hampers the extraction of essential mo-tions. In this work, statistical or machine-learning techniques were used to learn functional relationships from MD simulation data without requiring a detailed model of the un-derlying physics or biological relations. In particular, a series of machine learning methods (Random Forest, logistic regression, support vector machines, xgboost classifier, etc.) were used to find new patterns from MD simulation data. Feature extraction from the best classification model lead to an understanding of the conformational rearrangements along the experimental directed evolution process. Finally, data gathered from previous work packages lead to the proposal of new point mutations for LovD and TrpB to test experimentally for improved catalytic activity.
The development of new efficient computational protocols to design active biocatalyst/enzyme mutants for a given reaction would reduce the experimental costs of the directed evolution process, as fewer designs would have to be tested, and therefore impact the EU industry competitiveness.