Periodic Reporting for period 5 - Protein Lego (Protein design from sub-domain sized fragments)
Reporting period: 2020-08-01 to 2021-08-31
Based on these observations we set out to develop a general protein design approach using the principle of recombination. Using sensitive sequence comparison methods, we identified more than thousand such evolutionary successful protein fragments and generated a database called Fuzzle (short for Fold Puzzle) that is easily accessible as a webserver. We further classified the fragments based on their associated functions. These fragments can now be used as building blocks in protein design. To test this, we chose a couple of fragments from different folds and tested them in recombination experiments. We were able to build stable hybrid proteins and even achieved the transfer of functional sites with particular fragments. Based on these insights we implemented a computational tool called Protlego that automates the generation of hybrid protein and their analysis.
Overall, the Protein Lego project enabled important insights into two related research fields, namely evolution and design or proteins. The databases and tools we provide enable protein scientists to explore evolutionary relationships and protein engineers to choose building blocks for the construction of new proteins. In fact, we provide a unique engineering methodology that offers an innovative new route to rationally design proteins by assembling existing protein pieces in a Lego-like manner. We have significantly contributed to increasing the general understanding of sequence-structure-function relations in proteins and our ability to design complex, custom-made proteins, which has impact simultaneously both in synthetic biology and bioengineering.
The ability to recombine subdomain-sized fragments is what we explored next. In particular we focused on a set of different protein folds and recombined fragments that originated from these. We successfully generated a number of stable hybrid proteins that were analysed in detail. In these designs we explored the option of transferring functional sites associated with a particular fragment and could show that this is a valid method for the introduction of binding properties. Additionally, we gained insights into evolutionary relationships of the investigated protein folds. Based on these insights we developed a tool to automate the design approach and to computationally test different options for recombination. This new tool called Protlego is publicly available (https://hoecker-lab.github.io/protlego) and allows a user to fetch fragments from the Fuzzle database, choose a particular pair of which to generate possible recombinants, and energetically score these. The program includes tools for subsequent analysis that we also made generally available in a stand-alone webserver (http://proteintools.uni-bayreuth.de).
We reported about the results and tools at many international conferences, workshops and seminars. The knowledge is further disseminated through publications as journal articles.