Periodic Reporting for period 4 - cis-CONTROL (Decoding and controlling cell-state switching: A bottom-up approach based on enhancer logic)
Okres sprawozdawczy: 2021-12-01 do 2023-05-31
In this project we are addressing the enhancer decoding challenge from a single-cell perspective, broadly termed as single-cell regulatory genomics. Particularly, we are using a combination of in vivo massively parallel enhancer-reporter assays, single-cell genomics on microfluidic devices, computational modelling, and synthetic enhancer design. We first apply this strategy to decipher the enhancer code in melanoma, a relevant case study due to the presence of distinct melanoma cell states, and the finding that cells can switch between these states. We also broaden our experimental model systems by looking at epithelial-to-mesenchymal transition, as well as wound response models. Several technological components of this project include the establishment of droplet-based single-cell techniques (e.g. single-cell ATAC-seq and multi-omics); and the development of novel algorithmic and AI approaches to analyze single-cell epigenomics data, and to decipher enhancer logic. Ultimately, our aim is to build enhancer models that can be used to (1) prioritize enhancer mutations in cancer and normal genomes, and (2) to design and optimize enhancers that drive cell type specific reporter activity, so that they can be used in gene therapy applications, for example to activate a suicide gene in a specific cancer cell state.
We developed several new technologies, including a custom droplet microfluidics technique called HyDrop, to perform scATAC-seq and scRNA-seq at a cost that is 100x cheaper compared to commercial methods. We then benchmarked HyDrop scATAC-seq in a large international consortium. We also optimized massively parallel enhancer reporter assays and used them to measure the activity of melanoma enhancers in different states and of hepatocyte enhancers in vivo in the mouse liver.
Importantly, we developed multiple computational methods including a Python version of SCENIC, called pySCENIC; NextFlow pipelines to facilitate the use of SCENIC; a topic modeling approach, called cisTopic, to analyze scATAC-seq data; a single-cell viewer called SCope; a method to integrate scATAC-seq and scRNA-seq data on a virtual spatial template, called scoMAP; and finally SCENIC+ to infer enhancer-GRNs from sc-multiome data.
Exploitation and dissemination: our single-cell and spatial data sets are all publicly available as resources, both as raw and processed data (via GEO and SRA), and as online atlases in SCope. Our HyDrop protocols are all publicly available at protocols.io and are used by dozens of labs around the world. All our methods are available via GitHub where we also solved many dozens of issues to help users use our tools, and to improve them.
As final outcome of cis-CONTROL, we developed three strategies for the AI-driven design of synthetic enhancers. We evaluated the synthetic enhancers in human melanoma cell lines, and in vivo in Drosophila. The design process led again to a further understanding of enhancer logic, including the role of repressors versus activators, to balance enhancer activity. We achieved these results by the intricate combination of computational modeling and experimental testing.
Exploitation and dissemination: Publications were released on bioRxiv at the time of submission. All deep learning models have been made available via kipoi.org a resource to share trained deep learning models. Our results led to interactions with Pharma and Biotech industry, and to a Proof of Concept Grant, to further develop our synthetic enhancer design pipeline towards the design of enhancers for gene therapy.