Overall most of our initial aims were completed successfully.
We have completed the characterization of our newly-developed iTSC reprogramming paradigm and publish this new model in the prestigious journal cell stem cell (Benchetrit et al. 2015).
Next, we have proposed to generate a quadruple fluorescence knock-in reporter system to be able to sort out reprogrammable cells for transcriptional analysis and completed this task successfully. Currently we have a system that contains four reporters (Nanog-2A-EGFP- specific for iPSC reprogramming, Elf5-2A-EYFP-NLS- specific for TSC reprogramming, and Utf1-2A-tdTomato and Esrrb-2A-TagBFP- shared between TSC and iPSC reprogramming). This system will refer hereafter as “BYKE” system.
We examined the transcriptome (bulk RNA-seq and single cell RNA-seq), methylome (RRBS), chromatin accessibility (ATAC-seq), chromatin activity (ChIP-seq for H3K4me2 and H3K27ac) and genomic stability (CNVs) of BYKE cells undergoing reprogramming to induced pluripotent stem cells (iPSCs) and induced TSCs (iTSCs) along the reprogramming process. Using single-cell analysis, we revealed unique and previously unknown stage-specific markers, as well as markers for faithful (Tdgf1 for OSKM and Cd82 for GETM) and failed (Anx3 for OSKM) reprogramming process.
Chromatin accessibility and activity analyses identified many reprogramming blockers, such as Usf1/Usf2, Nrf2 and MafK, along with other oxidative stress response genes that significantly hinder both reprogramming systems but with different dynamics. These results suggest a massive occupancy on fibroblastic identity regions even after 9 days of reprogramming, especially in GETM reprogramming.
In accordance with that, the AP1/CREB/ATF binding sites, which their corresponding protein members act as somatic cell identity safeguards that block the reprogramming process to pluripotency are significantly more enriched in GETM reprogramming peaks compared to OSKM reprogramming peaks. In contrast, in regions that are open in fibroblasts and closed upon reprogramming, the binding sites of the AP1/CREB/ATF family of proteins are significantly more enriched in OSKM reprogramming compared to GETM reprogramming. These results explain the potent ability of OSKM to erase somatic cell identity, as well as the continued presence of MEF-like cells in GETM reprogramming, even at day 12 of the process, as we observed in the SC-RNA-seq analysis.
Lastly, by integrating all the data together, we illuminated key aspects that characterize each fate at all levels of regulation. Remarkably, by comparing both systems we show that from the onset of the reprogramming process, OSKM define regions that are important for the development of the heart and brain, the two most essential organs for the developing embryo. Mechanistically, we show that while GETM shut off the embryonic programs by DNA methylation, OSKM open these regions but retain them as inactive by eliminating the histone mark H3K27ac.
In conclusion, besides providing the first multi-layer characterization of cells undergoing reprogramming to the TSC state, our approach of conducting concomitant and comparative multi-omics analysis of cells acquiring both pluripotent and TSC states yield knowledge that cannot be revealed otherwise, such as when each reprogramming process is analyzed separately. We show that this unique approach of comparative parallel multi-omics analysis is a powerful tool to illuminate features of the nuclear reprogramming process as well as general and broad cellular plasticity principles.