Dissecting quantitative traits and their underlying genetic interactions via systematic genome editing

Project Information

SystGeneEdit

Grant agreement ID: 742804

DOI

10.3030/742804

Project closed

EC signature date 16 May 2017

Start date 1 November 2017

End date 31 October 2022

Funded under

EXCELLENT SCIENCE - European Research Council (ERC)

Total cost

€ 2 499 995,00

EU contribution

€ 2 499 995,00

2 499 995,00

Coordinated by

EUROPEAN MOLECULAR BIOLOGY LABORATORY
Germany

Periodic Reporting for period 4 - SystGeneEdit (Dissecting quantitative traits and their underlying genetic interactions via systematic genome editing)

Reporting period: 2022-05-01 to 2022-10-31

Understanding how genetic variation contributes to phenotypic diversity is one of the greatest challenges in biology and medicine. While we know that single nucleotides (letters) combine to form functional genetic elements (words), we can’t yet predict which variants in the sequence matter and how they combine to impact traits and disease risk (meaning). In this project, we used budding yeast to probe the functional relevance of natural single nucleotide polymorphisms (SNPs) in strains that evolved over millions of years. To profile the effects of each SNP, we developed a multiplexed precision CRISPR editing system where each cell in a culture receives a distinct SNP. Inside each cell, a guide RNA directs Cas9 to cut a target site which is repaired by a donor DNA encoding the desired SNP. We measured the effects of thousands of SNPs on cellular fitness across many environmental conditions impacting diverse cellular pathways. Our results revealed that all types of SNPs can impact fitness but those that change protein sequence are enriched in overall number and effect size.

The success of this project depended on pioneering several novel technologies. CRISPR editing with donor DNA is naturally inefficient in all organisms. To overcome this challenge, we developed new methods to enhance the efficiency of repair by recruiting donor DNA to the cut sites. To investigate the efficiency and fidelity of this system across the genome, we developed new whole-genome sequencing methods and analysis workflows. While most of the genome was edited precisely and efficiently with donor recruitment, some genomic regions were still prone to alternative, undesired edit outcomes. We developed computational models to predict the likelihood of these aberrant editing events across the genome based on sequence features. In addition, we developed methods to characterize more complex phenotypes, including a novel microscopy method that captures and analyzes over 10,000 microscopic images per second, and therefore scales with the complexity of natural and disease-associated genetic variation. Together, these technical achievements enabled investigating multi-modal effects of genetic variants across the entire genome.

Conclusion of the Action: The biological insights and technologies developed during this project have implications for eukaryotes far beyond the yeast model system. For example, our findings that multiple (not single) causal variants underpin many traits have important implications for genetic mapping studies in humans, where it is assumed that the majority of the detected genomic regions harbor a single causative variant. We also found that variants in the yeast system are active only under specific environmental conditions; translated to the human system, this means that the lifestyle of a person could be modified based on their genetic background to avoid a disease. Ultimately, large-scale unbiased detection of functional SNPs will provide important datasets to train and validate computational approaches to predict the impact of an individual’s genetic makeup on health and disease. Our computational models for difficult-to-edit regions should be applicable across organisms and help researchers improve the accuracy and safety of genome editing. Finally, our new gene editing and phenotypic screening technologies give researchers powerful new tools to explore how genetic variation impacts diverse cellular functions.

1. We improved MAGESTIC (Roy et al. 2018) by optimizing guide RNA and Cas9 expression to boost the efficiency of homology directed repair (HDR). We benchmarked donor recruitment against alternative HDR enhancement strategies, and combined those into a single, superior editing method termed (Roy et al., in preparation).
2. We developed methods to investigate the efficiency and fidelity of precision editing, including a fast, simple, cheap, and scalable method that produces sequencing-ready libraries directly from yeast cultures (Vonesch et al. 2021). We performed genome sequencing of thousands of edited strains and confirmed that the vast majority of clones carried the desired variants without off-target effects. We found that the likelihood of erroneous on-target structural variants was dependent on genome context, and constructed machine-learning models to predict these “hard-to-edit” regions, facilitating the development of improved editing methods (Li et al., in preparation).
3. We constructed variant pools and isolated thousands of strains, each with a different variant. We optimized phenotyping and analytic pipelines towards maximizing sensitivity for subtle effects. We profiled fitness across chemical, drug and nutritional perturbations, to investigate how variants are active under different environmental conditions. We found that most genes have not been linked to growth in these conditions in previous knockout screens. Their protein products show physical interactions with those of other genes implicated in these traits (Vonesch et al., in preparation). Many natural variants show genetic interactions with genes implied in the same trait by the KO screen. We used an improved MAGESTIC version to dissect QTL down to their causative nucleotide variants (Roy et al., in preparation).
4. Many SNPs only have an effect in the presence of another SNP. To enable systematic discovery of such genetic interactions, we developed a new method that allows single cells to obtain multiple edits. A key achievement was the development of a barcoding system capable of linking the barcodes from multiple rounds of editing, allowing the edit combinations to be read out by short-read sequencing (Roy et al., manuscript in preparation).
5. We introduced several novel methods that allow characterization of 1,000s of CRISPR perturbation effects within a single experiment. One of these methods, termed image-enabled cell sorting (Schraivogel et al. 2022), allows isolation of cells according to information from microscopic images at speeds up to 15,000 cells per second. This method will enable new types of experimental strategies and is compatible across organisms, from yeast to mammalian cells. The other method is a targeted, single-cell RNA-seq assay for the massively parallel molecular phenotyping of cells carrying genetic perturbations. We obtained rich molecular phenotyping information on the expression of ~200 genes across ~1,000 genetic perturbations (Schraivogel, Gschwind et al. 2020), and are now using it to assign functions to disease associated genetic variants (ongoing work).

- With the improvements to our editing platform MAGESTIC we now dramatically exceed capabilities of other CRISPR-based variant engineering methods.
- MAGESTIC 2.0 enabled us to generate high-quality natural variant libraries where > 95 % of strains contain the correct changes in genome sequence.
- We phenotyped libraries to assign functions to natural variants and dissected potential interactions between variants.
- We have developed a simplified whole-genome sequencing (WGS) library preparation workflow that skips the traditional genomic DNA isolation step.
- By sequence-validating thousands of edited strains, we identified identified and can now predict hard to edit regions of the genome.
- We implemented assays to query the effect of thousands of CRISPR perturbations on gene expression of microscopic phenotypes in single cells.

erc_fig.png

Periodic Reporting for period 4 - SystGeneEdit (Dissecting quantitative traits and their underlying genetic interactions via systematic genome editing)

Share this page Share this page on social networks

Download Download the content of the page