Identification and quantification of low-frequency mutations remain tantalizing despite improvements in the baseline error rate of subsequent-technology sequencing applied sciences. Here, we characterize a technique, termed SaferSeqS, that addresses these challenges by (1) effectively introducing identical molecular barcodes in the Watson and Crick strands of template molecules and (2) enriching aim sequences with strand-specific PCR. The manner achieves high sensitivity and specificity and detects variants at frequencies below 1 in 100,000 DNA template molecules with a background mutation rate of <5 × 10–7 mutants per base pair (bp). We demonstrate that it can evaluate mutations in a single amplicon or simultaneously in multiple amplicons, assess limited quantities of cell-free DNA with high recovery of both strands and reduce the error rate of existing PCR-based molecular barcoding approaches by >100-fold.
The sequencing data generated on this respect will likely be obtained from the European Genome–phenome Archive (accession amount EGAS00001005048).
The SaferSeqS bioinformatics pipeline is implemented in Python. The source code is on hand in a Zenodo repository (https://doi.org/10.5281/zenodo.4588264).
Shendure, J. et al. DNA sequencing at 40: past, most modern and future. Nature 550, 345–353 (2017).
McMahon, M. A. et al. The HBV drug entecavir—effects on HIV-1 replication and resistance. N. Engl. J. Med. 356, 2614–2621 (2007).
Robins, H. S. et al. Complete analysis of T-cell receptor β-chain vary in αβ T cells. Blood 114, 4099–4107 (2009).
Miller, W. et al. Sequencing the nuclear genome of the extinct woolly extensive. Nature 456, 387–390 (2008).
Bruijns, B., Tiggelaar, R. & Gardeniers, H. Vastly parallel sequencing tactics for forensics: a overview. Electrophoresis 39, 2642–2654 (2018).
Hoang, M. L. et al. Genome-wide quantification of uncommon somatic mutations in novel human tissues the spend of hugely parallel sequencing. Proc. Natl Acad. Sci. USA 113, 9846–9851 (2016).
Chiu, R. W. et al. Noninvasive prenatal analysis of fetal chromosomal aneuploidy by hugely parallel genomic sequencing of DNA in maternal plasma. Proc. Natl Acad. Sci. USA 105, 20458–20463 (2008).
Mattox, A. Okay. et al. Applications of liquid biopsies for most cancers. Sci. Transl. Med. 11, eaay1984 (2019).
Newman, A. M. et al. Integrated digital error suppression for improved detection of circulating tumor DNA. Nat. Biotechnol. 34, 547–555 (2016).
Dou, Y. et al. Real detection of mosaic variants in sequencing data without matched controls. Nat. Biotechnol. 38, 314–319 (2020).
Razavi, P. et al. High-depth sequencing unearths the sources of plasma circulating cell-free DNA variants. Nat. Med. 25, 1928–1937 (2019).
Meynert, A. M., Bicknell, L. S., Hurles, M. E., Jackson, A. P. & Taylor, M. S. Quantifying single nucleotide variant detection sensitivity in exome sequencing. BMC Bioinformatics 14, 195 (2013).
Kinde, I., Wu, J., Papadopoulos, N., Kinzler, Okay. W. & Vogelstein, B. Detection and quantification of uncommon mutations with hugely parallel sequencing. Proc. Natl Acad. Sci. USA 108, 9530–9535 (2011).
Bettegowda, C. et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci. Transl. Med. 6, 224ra224 (2014).
Cohen, J. D. et al. Mixed circulating tumor DNA and protein biomarker-based liquid biopsy for the sooner detection of pancreatic cancers. Proc. Natl Acad. Sci. USA 114, 10202–10207 (2017).
Cohen, J. D. et al. Detection and localization of surgically resectable cancers with a multi-analyte blood take a look at. Science 359, 926–930 (2018).
Phallen, J. et al. Insist detection of early-stage cancers the spend of circulating tumor DNA. Sci. Transl. Med. 9, eaan2415 (2017).
Springer, S. et al. A multimodality take a look at to data the management of patients with a pancreatic cyst. Sci. Transl. Med. 11, eaav4772 (2019).
Springer, S. et al. A combination of molecular markers and clinical functions give a settle to the classification of pancreatic cysts. Gastroenterology 149, 1501–1510 (2015).
Tie, J. et al. Circulating tumor DNA analysis detects minimal residual illness and predicts recurrence in patients with stage II colon most cancers. Sci. Transl. Med. 8, 346ra392 (2016).
Wang, Y. et al. Detection of somatic mutations and HPV in the saliva and plasma of patients with head and neck squamous cell carcinomas. Sci. Transl. Med. 7, 293ra104 (2015).
Wang, Y. et al. Detection of tumor-derived DNA in cerebrospinal fluid of patients with main tumors of the mind and spinal cord. Proc. Natl Acad. Sci. USA 112, 9704–9709 (2015).
Wang, Y. et al. Diagnostic doable of tumor DNA from ovarian cyst fluid. eLife 5, e15175 (2016).
Springer, S. U. et al. Non-invasive detection of urothelial most cancers through the analysis of driver gene mutations and aneuploidy. eLife 7, e32143 (2018).
Schmitt, M. W. et al. Detection of extremely-uncommon mutations by subsequent-technology sequencing. Proc. Natl Acad. Sci. USA 109, 14508–14513 (2012).
Schmitt, M. W. et al. Sequencing little genomic targets with high efficiency and extreme accuracy. Nat. Techniques 12, 423–425 (2015).
Samorodnitsky, E. et al. Review of hybridization prefer versus amplicon-based solutions for complete-exome sequencing. Hum. Mutat. 36, 903–914 (2015).
Chabon, J. J. et al. Integrating genomic functions for non-invasive early lung most cancers detection. Nature 580, 245–251 (2020).
Zheng, Z. et al. Anchored multiplex PCR for centered subsequent-technology sequencing. Nat. Med. 20, 1479–1484 (2014).
Makarov, V. & Laliberte, J. Enhanced adapter ligation. US patent 10,208,338B2 (2019).
Peng, Q. et al. Targeted single primer enrichment sequencing with single pause duplex-UMI. Sci. Uncover. 9, 4810 (2019).
Vogelstein, B. et al. Cancer genome landscapes. Science 339, 1546–1558 (2013).
Snyder, M. W., Kircher, M., Hill, A. J., Daza, R. M. & Shendure, J. Cell-free DNA contains an in vivo nucleosome footprint that informs its tissues-of-origin. Cell 164, 57–68 (2016).
Nachmanson, D. et al. Targeted genome fragmentation with CRISPR/Cas9 permits like a flash and atmosphere friendly enrichment of little genomic regions and extremely-real sequencing with low DNA input (CRISPR–DS). Genome Res. 28, 1589–1599 (2018).
Kennedy, S. R. et al. Detecting ultralow-frequency mutations by duplex sequencing. Nat. Protoc. 9, 2586–2606 (2014).
Rago, C. et al. Serial analysis of human tumor burdens in mice by the analysis of circulating DNA. Cancer Res. 67, 9364–9370 (2007).
Lennon, A. M. et al. Feasibility of blood testing mixed with PET–CT to display masks for most cancers and data intervention. Science 369, eabb9601 (2020).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at https://arxiv.org/abs/1303.3997 (2013).
Li, H. et al. The sequence alignment/arrangement format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Smith, T., Heger, A. & Sudbery, I. UMI-instruments: modeling sequencing errors in ordinary molecular identifiers to present a settle to quantification accuracy. Genome Res. 27, 491–499 (2017).
Jiang, P. et al. Detection and characterization of jagged ends of double-stranded DNA in plasma. Genome Res. 30, 1144–1153 (2020).
We thank the other folks who participated on this respect for their braveness and generosity. We also thank M. Hoang, S. Sur, A. Mattox, A. Pearlman and members of the Ludwig Heart at Johns Hopkins for insightful and priceless scientific discussions. We’re grateful to C. Blair and Okay. Deem for educated technical and administrative assistance and to E. Cook dinner for illustrative assistance. This work was supported by The Lustgarten Foundation for Pancreatic Cancer Research, The Marcus Foundation, The Virginia and D.Okay. Ludwig Fund for Cancer Research, The Conrad N. Hilton Foundation, The John Templeton Foundation, Medical Research Future Fund Investigator Grant (APP1194970) and National Institutes of Health grants (T32 GM007309, U01 CA230691-01, P50 CA228991, U01 CA200469, R37 CA230400-01, and U01 CA152753).
B.V., Okay.W.Okay. and N.P. are founders of Thrive and Private Genome Diagnostics and maintain equity in Staunch Sciences and Private Genome Diagnostics. Okay.W.Okay. and N.P. are consultants to Thrive. Okay.W.Okay. and B.V. are consultants to Sysmex and Eisai, and Okay.W.Okay., B.V. and N.P. are advisors to CAGE Pharma. B.V. is also a well informed to Catalio, and Okay.W.Okay., B.V. and N.P. are consultants to Neophore. C.D. is a well informed to Thrive and is compensated with profits and equity. The companies named above, as well to barely a kind of companies, have licensed previously described applied sciences linked to the work described on this paper from Johns Hopkins College. J.D.C., C.D., B.V., Okay.W.Okay., C.T. and N.P. are inventors on all these applied sciences. Licenses to these applied sciences are or will likely be linked to equity or royalty payments to the inventors as well to to Johns Hopkins College. Additional patent capabilities on the work described on this paper are being filed by Johns Hopkins College. The terms of all these arrangements are being managed by Johns Hopkins College in step with its war of interest insurance policies. The final authors uncover no competing pursuits.
Witness overview data Nature Biotechnology thanks Paul Spellman and the barely a kind of, anonymous, reviewer(s) for their contribution to the label overview of this work.
Author’s expose Springer Nature stays neutral in regards to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Cohen, J.D., Douville, C., Dudley, J.C. et al. Detection of low-frequency DNA variants by centered sequencing of the Watson and Crick strands.
Nat Biotechnol (2021). https://doi.org/10.1038/s41587-021-00900-z