Periodic Reporting for period 1 - FuseSeq (Single molecule reconstruction for high-throughput, short-read sequencing technologies)
Reporting period: 2022-06-01 to 2023-11-30
To address this problem this project aimed to test the feasibility of a method that would bridge the gap between long reads and high throughput sequencing by developing a fragment labelling system compatible with high-throughput short-read sequencing technology. The labelling system consist of tags that can be incorporated into target nucleic acid sequences using a hyperactive transposase mutant and can be subsequently used to create fragments from the target molecule while retaining information about the fragments' original position in the target molecules, thus allowing the reconstruction of the sequence of the original, individual target molecules from them.
Several chemical strategies have been tested suitable for fragment generation and number of suitable strategies have been identified that successfully mediated chemical cleavage of the probes. As these approaches would require extra steps and introduction of new substances into existing sequencing library protocols, they might interfere with them. To avoid this, an alternative strategy has also been tested that utilizes chemical modification that results in the generation of fragments in parallel to the routine sample amplification steps found in sequencing library preparation protocols. This fragmentation strategy allows the usage of our probes in existing sample preparation protocols for sequencing library generation with minute alterations and thus will make the final method be easily includable as part of other commercially available sample preparation approaches for next generation sequencing increasing the future accessibility of the method for end-users.
Finally, we tested the insertion of probes into nucleic acid target molecules via transposase. We have confirmed the insertion of them using commercially available formulation of the transposase enzyme, however the achievable insertion frequency was lower than required for our final application. Results with an in house produced transposase mutant indicated insertion of the probe allowing the possibility to finetune the insertion frequency to reach the required levels for the final application.
The project has thus successfully demonstrated the feasibility of all the crucial steps required for method to be applied to complex samples.
For the method to be applicable for complex samples and permit single molecule reconstruction further research is needed to achieve high enough incorporation rates required for generation of fragment sizes compatible with existing short-read sequencing methods. The results obtained using an in-house produced transposase mutant shows promise to address this issue and thus further future work will focus on pursuing this strategy and the adaptation of the method to different short-read sequencing methods.