In the middle of the BrainBreaks project, we invented computational tools and mathematical models to solve problems developed during our investigations. These are developments that are beyond the state of the art in the field of genomics and neurogenesis.
1. Utilizing Artificial Intelligence to Decipher DNA Replication Direction. The extensive application of artificial intelligence has primarily centered around images. Particularly within the realm of pathology, numerous programs have been developed to learn cell composition, shape, and pattern counts within diseased tissues. They aid in determining the grade and type of diseases based on histology or radiology images. In a similar vein, genomics data share resemblances to image data to a certain extent. Analogous to pixels that define the resolution of a DNA-associated marker, genomics data encapsulate intensity, distribution, and density information just as imaging data does. Although this approach is common in the field of pathology, its implementation within genomics remains relatively unexplored. Recognizing this, we opted to leverage the capabilities of artificial intelligence to predict DNA replication direction. The dynamic nature of DNA replication patterns presents unique challenges. While a similar approach has been devised in the past for specific datatype linked to DNA replication, its technical intricacy has confined its adoption to only a few select laboratories worldwide. In our pursuit, we have chosen to establish an artificial intelligence program for an alternative approach- one that demands significantly lower technical prerequisites. Our aim is to make this approach accessible and widely utilized by researchers within the genomics domain.
Grounded in a convolutional neural network, our artificial intelligence is structured in three distinct phases. The first phase entails pinpointing genomic regions housing specific start and end features associated with DNA replication. In the subsequent phase, training data is generated, outlining regions containing one to several start and end features. Finally, the third phase involves training the model using the designated training data. With a resounding success, this network exhibits the capacity to ascertain DNA replication direction with an impressive confidence level exceeding 80%, alongside high reproducibility. To enhance accessibility, this network has been made available on an open-access GitHub repository.
2. Exploring a Theoretical Mathematical Model
Our journey in Aim 2 revealed a rather unexpected fact: very little is known about the clonal dynamics, neural progenitor cell pool size, and the factors- both intrinsic and extrinsic-that govern embryonic neurogenesis. Upon thorough literature research, we uncovered only a handful of papers that had experimentally investigated mice neurogenesis. Interestingly, these studies primarily focused on observations within the frontal cortex. These efforts led to the development of models based on the relative speed of DNA incorporation and the proportion of neural progenitor cells. However, none of these models managed to explain the pool size or account for the occurrence of clonal expansion at different branch points.
To address this intriguing puzzle, my laboratory is currently collaborating with a systems biology lab at the German Cancer Research Center, which is my host institute. Our goal is to craft a mathematical model for embryonic neurogenesis. Unlike existing approaches, our model hinges on the mutation frequency within DNA sequences. It seeks to unveil the molecular lineage of each mutation, pinpoint the earliest common ancestor, and identify the driving force that emerges at the branching point where clonal expansion takes place.
While this kind of approach is well-established in the realm of cancer genomics, it's quite novel when applied to normal tissues or embryos. Through this innovative approach, we aim to shed light on the intricate process of embryonic neurogenesis and unveil the underlying factors that contribute to its complexity. We anticipate this model will reveal the ground-breaking dynamic of embryonic neurogenesis.
3. Unveiling the Frequency of DNA Breaks in Neural Progenitor Cells
At the heart of BrainBreaks lies a pivotal endeavor: deciphering the frequency of DNA breaks within neural progenitor cells. This is the very core of our exploration. Currently, we're in the process of assembling the definitive materials required to directly measure DNA breaks within the developing brain, specifically focusing on neural progenitor cells. Our aim is to investigate these cells both in their natural, unstressed state and under conditions of stress. Our ambitious goal is to compile a comprehensive map showcasing the locations of DNA breaks that occur in vivo. We're excited to share that we're on track to complete this mapping endeavor by the conclusion of the BrainBreaks period. This accomplishment promises to provide us with profound insights into the world of neural progenitor cells and their intricate DNA dynamics.