Periodic Reporting for period 1 - EvolSpliceKinetics (From co-transcriptional splicing kinetics to the evolutionary impact of exon and intron definition)
Período documentado: 2020-08-01 hasta 2022-07-31
Adding to this complexity, it is now known that often, introns located towards the start of the RNA molecule are spliced out whilst transcription of regions further down is still in progress. This is referred to as “co-transcriptional” splicing, and it opens up completely new ways of thinking about the process. Rather than simply considering the end product of splicing – which regions are removed and which ones remain – one can now turn the spotlight on the dynamics of the process. How do splicing and transcription affect one another, given that they often happen simultaneously? Are all introns removed equally fast? Do the sequence signals that control splicing differ for introns with slow and fast splicing?
In this project, I have studied the dynamics of co-transcriptional splicing in the fruit fly Drosophila melanogaster, a species whose genes have widely varying exon-intron structures. I have found the dynamics of splicing to vary dramatically between introns. Moreover, the kinetics of how an RNA is transcribed appear to co-vary with the kinetics of its splicing.
In addition, the project led me to interact with scientists from a wide array of backgrounds. I realized how gravely research was often hampered by the fact that young researchers were not sufficiently trained in statistical thinking. Hence, a further goal of the project became to implement interventions to address this challenge.
a seminar to students at the Faculty of Science of the University of Lisbon, and two practicals on an introductory bioinformatics course organized by Egypt Scholars, directed at students in Egypt.
I proceeded with a detailed characterisation of co-transcriptional splicing in the fruit fly, published as a co-first author paper in the RNA Journal (rna.078933.121v1) and presented at two national and three international conferences. SR varied drastically between introns, and correlated with properties of the intron in unexpected ways. Moreover, Pol II tended to pause at different locations depending on SR. I used Bayesian modelling to explore different hypotheses for mechanisms underlying these patterns. I concluded that the data could only be accounted for by a model where the same intron can stochastically switch between different modes of splicing kinetics.
Next, I checked whether the frequency or evolutionary conservation of splicing-related sequence signals depended on SR. I failed to uncover any significant patterns. This could be because the data was insufficient for such data-hungry analyses, or because variation in SR is either not functionally relevant or not controlled through sequence signals.
In addition, I used three types of interventions to improve statistical thinking skills among biomedical researchers. Firstly, I designed an eight-week introductory statistics course. The course emphasized conceptual understanding, hands-on practice on real data and group work. I taught this course at the iMM in 2021, training a total of ca. 50 early career researchers. The course was repeated in the spring of 2022, as an online Arabic-language version, delivered in collaboration with Egypt Scholars. Both iterations of the course received overwhelmingly positive feedback.
Secondly, in the June of 2022, I organized an international summer school on applying modelling techniques to biological data. The summer school, funded by a Horizon 2020 grant, was attended by researchers from the iMM in Portugal, the Max-Delbrück Zentrum in Germany, the Weizmann Institute of Science in Israel, and the University of Oxford in the UK. The participants took part in five days of hands-on workshops, delivered by an international group of instructors.
Thirdly, I worked individually with researchers to help them better understand their data. This included the supervision of 2 PhD students, 1 Master’s student and one intern, as well as aiding several other researchers. This work has led to one co-first author publication (10.3390/biomedicines10020199) with at least three other manuscripts in preparation.
In addition to the research work, through the interventions detailed above, over 100 early-career researchers in Portugal, Egypt, and elsewhere around the world, have received training in how to think better about data. My focus now is to improve my interventions even more, and to reach more and more young scientists.