Development of the variant calling algorithm, ADIScan, and its use to estimate discordant sequences between monozygotic twins

被引:3
作者
Cho, Yangrae [1 ,2 ]
Lee, Sunho [1 ,3 ]
Hong, Jong Hui [1 ,4 ]
Kim, Byong Joon [1 ]
Hong, Woon-Young [1 ]
Jung, Jongcheol [1 ]
Lee, Hyang Burm [2 ]
Sung, Joohon [5 ]
Kim, Han-Na [6 ]
Kim, Hyung-Lae [6 ]
Jung, Jongsun [1 ]
机构
[1] Syntekabio Inc, Techno-2ro B-512, Daejeon 34025, South Korea
[2] Chonnam Natl Univ, CALS, DFTBA, Gwangju 61186, South Korea
[3] Seoul Natl Univ, Sch Comp Sci & Engn, Seoul 151742, South Korea
[4] Seoul Natl Univ, Coll Pharm, Res Inst Pharmaceut Sci, Seoul 08826, South Korea
[5] Seoul Natl Univ, Sch Publ Hlth, Dept Epidemiol, Complex Dis & Genome Epidemiol Branch, Seoul 08826, South Korea
[6] Ewha Womans Univ, Sch Med, Dept Biochem, Seoul 07985, South Korea
关键词
SOMATIC POINT MUTATIONS; GENETIC-VARIATION; GENOME; IDENTIFICATION; MOSAICISM; DISCOVERY; CANCER; RATES; DEEP; SNP;
D O I
10.1093/nar/gky445
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Calling variants from next-generation sequencing (NGS) data or discovering discordant sequences between two NGS data sets is challenging. We developed a computer algorithm, ADIScan1, to call variants by comparing the fractions of allelic reads in a tester to the universal reference genome. We then created ADIScan2 by modifying the algorithm to directly compare two sets of NGS data and predict discordant sequences between two testers. ADIScan1 detected >99.7% of variants called by GATK with an additional 724 393 SNVs. ADIScan2 identified similar to 500 candidates of discordant sequences in each of two pairs of the monozygotic twins. About 200 of these candidates were included in the similar to 2800 predicted by VarScan2. We verified 66 true discordant sequences among the candidates that ADIScan2 and VarScan2 exclusively predicted. ADIScan2 detected many discordant sequences overlooked by VarScan2 and Mutect, which specialize in detecting low frequency mutations in genetically heterogeneous cancerous tissues. Numbers of verified sequences alone were >5 times more than expected based on recently estimated mutation rates from whole genome sequences. Estimated post-zygotic mutation rates were 1.68 x 10(-7) in this study. ADIScan1 and 2 would complement existing tools in screening causative mutations of diverse genetic diseases and comparing two sets of genome sequences, respectively.
引用
收藏
页数:12
相关论文
共 45 条
[1]   One thousand somatic SNVs per skin fibroblast cell set baseline of mosaic mutational load with patterns that suggest proliferative origin [J].
Abyzov, Alexej ;
Tomasini, Livia ;
Zhou, Bo ;
Vasmatzis, Nikolaos ;
Coppola, Gianfilippo ;
Amenduni, Mariangela ;
Pattni, Reenal ;
Wilson, Michael ;
Gerstein, Mark ;
Weissman, Sherman ;
Urban, Alexander E. ;
Vaccarino, Flora M. .
GENOME RESEARCH, 2017, 27 (04) :512-523
[2]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[3]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[4]   Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma [J].
Ceccarelli, Michele ;
Barthel, Floris P. ;
Malta, Tathiane M. ;
Sabedot, Thais S. ;
Salama, Sofie R. ;
Murray, Bradley A. ;
Morozova, Olena ;
Newton, Yulia ;
Radenbaugh, Amie ;
Pagnotta, Stefano M. ;
Anjum, Samreen ;
Wang, Jiguang ;
Manyam, Ganiraju ;
Zoppoli, Pietro ;
Ling, Shiyun ;
Rao, Arjun A. ;
Grifford, Mia ;
Cherniack, Andrew D. ;
Zhang, Hailei ;
Poisson, Laila ;
Carlotti, Carlos Gilberto, Jr. ;
Tirapelli, Daniela Pretti da Cunha ;
Rao, Arvind ;
Mikkelsen, Tom ;
Lau, Ching C. ;
Yung, W. K. Alfred ;
Rabadan, Raul ;
Huse, Jason ;
Brat, Daniel J. ;
Lehman, Norman L. ;
Barnholtz-Sloan, Jill S. ;
Zheng, Siyuan ;
Hess, Kenneth ;
Rao, Ganesh ;
Meyerson, Matthew ;
Beroukhim, Rameen ;
Cooper, Lee ;
Akbani, Rehan ;
Wrensch, Margaret ;
Haussler, David ;
Aldape, Kenneth D. ;
Laird, Peter W. ;
Gutmann, David H. ;
Noushmehr, Houtan ;
Iavarone, Antonio ;
Verhaak, Roel G. W. .
CELL, 2016, 164 (03) :550-563
[5]   Genomic contributions to Mendelian disease [J].
Chakravarti, Aravinda .
GENOME RESEARCH, 2011, 21 (05) :643-644
[6]   Prevalence of Rare Genetic Variations and Their Implications in NGS-data Interpretation [J].
Cho, Yangrae ;
Lee, Chul-Ho ;
Jeong, Eun-Goo ;
Kim, Min-Ho ;
Hong, Jong Hui ;
Ko, Younhee ;
Lee, Bomnun ;
Yun, Gilly ;
Kim, Byong Joon ;
Jung, Jongcheol ;
Jung, Jongsun ;
Lee, Jin-Sung .
SCIENTIFIC REPORTS, 2017, 7
[7]   Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples [J].
Cibulskis, Kristian ;
Lawrence, Michael S. ;
Carter, Scott L. ;
Sivachenko, Andrey ;
Jaffe, David ;
Sougnez, Carrie ;
Gabriel, Stacey ;
Meyerson, Matthew ;
Lander, Eric S. ;
Getz, Gad .
NATURE BIOTECHNOLOGY, 2013, 31 (03) :213-219
[8]   The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next-generation sequencing [J].
Clement, Nathan L. ;
Snell, Quinn ;
Clement, Mark J. ;
Hollenhorst, Peter C. ;
Purwar, Jahnvi ;
Graves, Barbara J. ;
Cairns, Bradley R. ;
Johnson, W. Evan .
BIOINFORMATICS, 2010, 26 (01) :38-45
[9]   Variation in genome-wide mutation rates within and between human families [J].
Conrad, Donald F. ;
Keebler, Jonathan E. M. ;
DePristo, Mark A. ;
Lindsay, Sarah J. ;
Zhang, Yujun ;
Casals, Ferran ;
Idaghdour, Youssef ;
Hartl, Chris L. ;
Torroja, Carlos ;
Garimella, Kiran V. ;
Zilversmit, Martine ;
Cartwright, Reed ;
Rouleau, Guy A. ;
Daly, Mark ;
Stone, Eric A. ;
Hurles, Matthew E. ;
Awadalla, Philip .
NATURE GENETICS, 2011, 43 (07) :712-U137
[10]   Early postzygotic mutations contribute to de novo variation in a healthy monozygotic twin pair [J].
Dal, Guelsah M. ;
Erguner, Bekir ;
Sagiroglu, Mahmut S. ;
Yuksel, Bayram ;
Onat, Onur Emre ;
Alkan, Can ;
Ozcelik, Tayfun .
JOURNAL OF MEDICAL GENETICS, 2014, 51 (07) :455-459