Comparative evaluation of the heterozygous variant standard deviation as a quality measure for next-generation sequencing

被引:1
作者
Hansen, Marcus Hoy [1 ,2 ,3 ]
Lang, Cecilie Steensboe [1 ,2 ,3 ,4 ]
Abildgaard, Niels [1 ,2 ,3 ]
Nyvold, Charlotte Guldborg [1 ,2 ,3 ]
机构
[1] Univ Southern Denmark, Dept Haematol, Haematol Res Unit, Haematol Pathol Res Lab, Odense, Denmark
[2] Univ Southern Denmark, Dept Haematol, Pathol Res Unit, Odense, Denmark
[3] Odense Univ Hosp, Odense, Denmark
[4] Odense Univ Hosp, Dept Clin Pathol, Odense, Denmark
关键词
FRAMEWORK;
D O I
10.1016/j.jbi.2022.104234
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Next-generation sequencing holds unprecedented throughput in terms of informational content to cost. The technology has entered the scene in laboratory di-agnostics and offers flexible workflows in biomedical research. However, the rapid acquisition of genomic data also gives rise to a substantial fraction of sequencing artifacts, causing the detection of false-positive germline variants or erroneous somatic mutations. Consequently, there is a pressing need for efficient and practical quality assessment in sequencing projects. In this study, we investigate using heterozygous variant allele frequency (VAF) standard deviation (sigma) for supplementary quality control. Whereas several proposed quality metrics are based on empirical assessments, the dispersion of the allele frequencies reflects a direct approximation of the inherent and discrete features of a diploid genome. Consequently, homologous chromosomes display heterozygous VAF of approximately 1/2. Based on the meta-analysis of 152 whole-exome sequencing data sets, we found that a reflects both sequencing coverage and noise and can be effectively modeled. It is concluded that the relative comparison of heterozygous VAF sigma provides a practical handle for quality assessment, even for samples afflicted with copy-number alterations. The approach can be implemented when performing whole-exome, whole-genome, or targeted panel sequencing and helps identify problematic samples, such as those retrieved from archived formalin-fixed paraffin-embedded tissue.
引用
收藏
页数:7
相关论文
共 45 条
[1]   Dynamic molecular monitoring reveals that SWI-SNF mutations mediate resistance to ibrutinib plus venetoclax in mantle cell lymphoma [J].
Agarwal, Rishu ;
Chan, Yih-Chih ;
Tam, Constantine S. ;
Hunter, Tane ;
Vassiliadis, Dane ;
Teh, Charis E. ;
Thijssen, Rachel ;
Yeh, Paul ;
Wong, Stephen Q. ;
Ftouni, Sarah ;
Lam, Enid Y. N. ;
Anderson, Mary Ann ;
Pott, Christiane ;
Gilan, Omer ;
Bell, Charles C. ;
Knezevic, Kathy ;
Blombery, Piers ;
Rayeroux, Kathleen ;
Zordan, Adrian ;
Li, Jason ;
Huang, David C. S. ;
Wall, Meaghan ;
Seymour, John F. ;
Gray, Daniel H. D. ;
Roberts, Andrew W. ;
Dawson, Mark A. ;
Dawson, Sarah-Jane .
NATURE MEDICINE, 2019, 25 (01) :119-+
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]   The ENCODE Blacklist: Identification of Problematic Regions of the Genome [J].
Amemiya, Haley M. ;
Kundaje, Anshul ;
Boyle, Alan P. .
SCIENTIFIC REPORTS, 2019, 9 (1)
[4]   Targeted enrichment beyond the consensus coding DNA sequence exome reveals exons with higher variant densities [J].
Bainbridge, Matthew N. ;
Wang, Min ;
Wu, Yuanqing ;
Newsham, Irene ;
Muzny, Donna M. ;
Jefferies, John L. ;
Albert, Thomas J. ;
Burgess, Daniel L. ;
Gibbs, Richard A. .
GENOME BIOLOGY, 2011, 12 (07)
[5]   Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants [J].
Belkadi, Aziz ;
Bolze, Alexandre ;
Itan, Yuval ;
Cobat, Aurelie ;
Vincent, Quentin B. ;
Antipenko, Alexander ;
Shang, Lei ;
Boisson, Bertrand ;
Casanova, Jean-Laurent ;
Abel, Laurent .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (17) :5473-5478
[6]   Robustness of Next Generation Sequencing on Older Formalin-Fixed Paraffin-Embedded Tissue [J].
Carrick, Danielle Mercatante ;
Mehaffey, Michele G. ;
Sachs, Michael C. ;
Altekruse, Sean ;
Camalier, Corinne ;
Chuaqui, Rodrigo ;
Cozen, Wendy ;
Das, Biswajit ;
Hernandez, Brenda Y. ;
Lih, Chih-Jian ;
Lynch, Charles F. ;
Makhlouf, Hala ;
McGregor, Paul ;
McShane, Lisa M. ;
Rohan, JoyAnn Phillips ;
Walsh, William D. ;
Williams, Paul M. ;
Gillanders, Elizabeth M. ;
Mechanic, Leah E. ;
Schully, Sheri D. .
PLOS ONE, 2015, 10 (07)
[7]   Determining Performance Metrics for Targeted Next-Generation Sequencing Panels Using Reference Materials [J].
Cleveland, Megan H. ;
Zook, Justin M. ;
Salit, Marc ;
Vallone, Peter M. .
JOURNAL OF MOLECULAR DIAGNOSTICS, 2018, 20 (05) :583-590
[8]   Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation [J].
Costello, Maura ;
Pugh, Trevor J. ;
Fennell, Timothy J. ;
Stewart, Chip ;
Lichtenstein, Lee ;
Meldrim, James C. ;
Fostel, Jennifer L. ;
Friedrich, Dennis C. ;
Perrin, Danielle ;
Dionne, Danielle ;
Kim, Sharon ;
Gabriel, Stacey B. ;
Lander, Eric S. ;
Fisher, Sheila ;
Getz, Gad .
NUCLEIC ACIDS RESEARCH, 2013, 41 (06) :e67
[9]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[10]   Exome sequencing of extreme phenotypes identifies DCTN4 as a modifier of chronic Pseudomonas aeruginosa infection in cystic fibrosis [J].
Emond, Mary J. ;
Louie, Tin ;
Emerson, Julia ;
Zhao, Wei ;
Mathias, Rasika A. ;
Knowles, Michael R. ;
Wright, Fred A. ;
Rieder, Mark J. ;
Tabor, Holly K. ;
Nickerson, Deborah A. ;
Barnes, Kathleen C. ;
Go, Lung ;
Gibson, Ronald L. ;
Bamshad, Michael J. .
NATURE GENETICS, 2012, 44 (08) :886-+