In-depth comparison of somatic point mutation callers based on different tumor next-generation sequencing depth data

被引:77
|
作者
Cai, Lei [1 ,2 ]
Yuan, Wei [1 ]
Zhang, Zhou [1 ,3 ]
He, Lin [1 ,4 ]
Chou, Kuo-Chen [2 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Key Lab Psychot Disorders 13dz2260500, Key Lab Genet Dev & Neuropsychiat Disorders, Bio X Inst,Minist Educ, Shanghai 200030, Peoples R China
[2] Gordon Life Sci Inst, Boston, MA 02478 USA
[3] Shanghai Jiao Tong Univ, Sch Med, Inst Biliary Tract Dis, Xinhua Hosp, Shanghai 200092, Peoples R China
[4] Zhejiang Univ, Sch Med, Womens Hosp, Hangzhou 310006, Zhejiang, Peoples R China
[5] King Abdulaziz Univ, CEGMR, Jeddah 21589, Saudi Arabia
来源
SCIENTIFIC REPORTS | 2016年 / 6卷
关键词
CANCER GENOMES; SNV DETECTION; WHOLE-EXOME; WEB SERVER; IDENTIFICATION; VARIANTS; MODES; DISCOVERY; PACKAGE; PSEKNC;
D O I
10.1038/srep36540
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Four popular somatic single nucleotide variant (SNV) calling methods (Varscan, SomaticSniper, Strelka and MuTect2) were carefully evaluated on the real whole exome sequencing (WES, depth of -50X) and ultra-deep targeted sequencing (UDT-Seq, depth of similar to 370X) data. The four tools returned poor consensus on candidates (only 20% of calls were with multiple hits by the callers). For both WES and UDT-Seq, MuTect2 and Strelka obtained the largest proportion of COSMIC entries as well as the lowest rate of dbSNP presence and high-alternative-alleles-in-control calls, demonstrating their superior sensitivity and accuracy. Combining different callers does increase reliability of candidates, but narrows the list down to very limited range of tumor read depth and variant allele frequency. Calling SNV on UDT-Seq data, which were of much higher read-depth, discovered additional true-positive variations, despite an even more tremendous growth in false positive predictions. Our findings not only provide valuable benchmark for state-of-the-art SNV calling methods, but also shed light on the access to more accurate SNV identification in the future.
引用
收藏
页数:9
相关论文
共 47 条
  • [21] Integrated analysis of the miRNA mRNA next-generation sequencing data for finding their associations in different cancer types
    Bhowmick, Shib Sankar
    Bhattacharjee, Debotosh
    Rato, Luis
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 84 (84)
  • [22] Investigation of next-generation sequencing data of Klebsiella pneumoniae using web-based tools
    Brhelova, Eva
    Antonova, Mariya
    Pardy, Filip
    Kocmanova, Iva
    Mayer, Jiri
    Racil, Zdenek
    Lengerova, Martina
    JOURNAL OF MEDICAL MICROBIOLOGY, 2017, 66 (11) : 1673 - 1683
  • [23] Detection of copy number variations based on a local distance using next-generation sequencing data
    Liu, Guojun
    Yang, Hongzhi
    He, Zongzhen
    FRONTIERS IN GENETICS, 2023, 14
  • [24] Spinal solitary fibrous tumor of the neck: Next-generation sequencing-based analysis of genomic aberrations
    Ando, Mizuo
    Kobayashi, Hiroshi
    Shinozaki-Ushiku, Aya
    Chikuda, Hirotaka
    Matsubayashi, Yoshitaka
    Yoshida, Masafumi
    Saito, Yuki
    Kohsaka, Shinji
    Oda, Katsutoshi
    Miyagawa, Kiyoshi
    Aburatani, Hiroyuki
    Mano, Hiroyuki
    Yamasoba, Tatsuya
    AURIS NASUS LARYNX, 2020, 47 (06) : 1058 - 1063
  • [25] The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies
    Stenson, Peter D.
    Mort, Matthew
    Ball, Edward V.
    Evans, Katy
    Hayden, Matthew
    Heywood, Sally
    Hussain, Michelle
    Phillips, Andrew D.
    Cooper, David N.
    HUMAN GENETICS, 2017, 136 (06) : 665 - 677
  • [26] RKDOSCNV: A Local Kernel Density-Based Approach to the Detection of Copy Number Variations by Using Next-Generation Sequencing Data
    Liu, Guojun
    Zhang, Junying
    Yuan, Xiguo
    Wei, Chao
    FRONTIERS IN GENETICS, 2020, 11
  • [27] Comparison of Next-Generation Sequencing and Clone-Based Sequencing in Analysis of Hepatitis B Virus Reverse Transcriptase Quasispecies Heterogeneity
    Gong, Ling
    Han, Yue
    Chen, Li
    Liu, Feng
    Hao, Pei
    Sheng, Jia
    Li, Xin-Hua
    Yu, De-Min
    Gong, Qi-Ming
    Tian, Fei
    Guo, Xiao-kui
    Zhang, Xin-Xin
    JOURNAL OF CLINICAL MICROBIOLOGY, 2013, 51 (12) : 4087 - 4094
  • [28] BagGMM: Calling copy number variation by bagging multiple Gaussian mixture models from tumor and matched normal next-generation sequencing data
    Li, Yaoyao
    Zhang, Junying
    Yuan, Xiguo
    DIGITAL SIGNAL PROCESSING, 2019, 88 : 90 - 100
  • [29] Predictors for a dementia gene mutation based on gene-panel next-generation sequencing of a large dementia referral series
    Koriath, C.
    Kenny, J.
    Adamson, G.
    Druyeh, R.
    Taylor, W.
    Beck, J.
    Quinn, L.
    Mok, T. H.
    Dimitriadis, A.
    Norsworthy, P.
    Bass, N.
    Carter, J.
    Walker, Z.
    Kipps, C.
    Coulthard, E.
    Polke, J. M.
    Bernal-Quiros, M.
    Denning, N.
    Thomas, R.
    Raybould, R.
    Williams, J.
    Mummery, C. J.
    Wild, E. J.
    Houlden, H.
    Tabrizi, S. J.
    Rossor, M. N.
    Hummerich, H.
    Warren, J. D.
    Rowe, J. B.
    Rohrer, J. D.
    Schott, J. M.
    Fox, N. C.
    Collinge, J.
    Mead, S.
    MOLECULAR PSYCHIATRY, 2020, 25 (12) : 3399 - 3412
  • [30] A Density Peak-Based Method to Detect Copy Number Variations From Next-Generation Sequencing Data
    Xie, Kun
    Tian, Ye
    Yuan, Xiguo
    FRONTIERS IN GENETICS, 2021, 11