Systematic comparison of variant calling pipelines of target genome sequencing cross multiple next-generation sequencers

被引:5
作者
Feng, Baosheng [1 ]
Lai, Juan [1 ]
Fan, Xue [2 ]
Liu, Yongfeng [1 ]
Wang, Miao [1 ]
Wu, Ping [1 ]
Zhou, Zhiliang [1 ]
Yan, Qin [1 ]
Sun, Lei [1 ]
机构
[1] GeneMind Biosci Co Ltd, Shenzhen, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai Gen Hosp, Clin Res Inst, Sch Med, Shanghai, Peoples R China
基金
英国科研创新办公室;
关键词
HD832; TSO500; NGS; target genome sequencing; SNVer; Varscan; 2; MUTATION;
D O I
10.3389/fgene.2023.1293974
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Targeted genomic sequencing (TS) greatly benefits precision oncology by rapidly detecting genetic variations with better accuracy and sensitivity owing to its high sequencing depth. Multiple sequencing platforms and variant calling tools are available for TS, making it excruciating for researchers to choose. Therefore, benchmarking study across different platforms and pipelines available for TS is imperative. In this study, we performed a TS of Reference OncoSpan FFPE (HD832) sample enriched by TSO500 panel using four commercially available sequencers, and analyzed the output 50 datasets using five commonly-used bioinformatics pipelines. We systematically investigated the sequencing quality and variant detection sensitivity, expecting to provide optimal recommendations for future research. Four sequencing platforms returned highly concordant results in terms of base quality (Q20 > 94%), sequencing coverage (>97%) and depth (>2000x). Benchmarking revealed good concordance of variant calling across different platforms and pipelines, among which, FASTASeq 300 platform showed the highest sensitivity (100%) and precision (100%) in high-confidence variants calling when analyzed by SNVer and VarScan 2 algorithms. Furthermore, this sequencer demonstrated the shortest sequencing time (similar to 21 h) at the sequencing mode PE150. Through the intersection of 50 datasets generated in this study, we recommended a novel set of variant genes outside the truth set published by HD832, expecting to replenish HD832 for future research on tumor variant diagnosis. Besides, we applied these five tools to another panel (TargetSeq One) for Twist cfDNA Pan-cancer Reference Standard, comprehensive consideration of SNP and InDel sensitivity, SNVer and VarScan 2 performed best among them. Furthermore, SNVer and VarScan 2 also performed best for six cancer cell lines samples regarding SNP and InDel sensitivity. Considering the dissimilarity of variant calls across different pipelines for datasets from the same platform, we recommended an integration of multiple tools to improve variant calling sensitivity and accuracy for the cancer genome. Illumina and GeneMind technologies can be used independently or together by public health laboratories performing tumor TS. SNVer and VarScan 2 perform better regarding variant detection sensitivity for three typical tumor samples. Our study provides a standardized target sequencing resource to benchmark new bioinformatics protocols and sequencing platforms.
引用
收藏
页数:8
相关论文
共 25 条
[1]  
Benjamin D., 2019, BIORXIV, DOI [10.1101/861054, DOI 10.1101/861054]
[2]   Applications and analysis of targeted genomic sequencing in cancer studies [J].
Bewicke-Copley, Findlay ;
Kumar, Emil Arjun ;
Palladino, Giuseppe ;
Korfi, Koorosh ;
Wang, Jun .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2019, 17 :1348-1359
[3]  
Cherry PD, 2022, CANCER RES, V82
[4]   A scalable high-throughput targeted next-generation sequencing assay for comprehensive genomic profiling of solid tumors [J].
Conroy, Jeffrey M. ;
Pabla, Sarabjot ;
Glenn, Sean T. ;
Seager, R. J. ;
Van Roey, Erik ;
Gao, Shuang ;
Burgher, Blake ;
Andreas, Jonathan ;
Giamo, Vincent ;
Mallon, Melissa ;
Lee, Yong Hee ;
DePietro, Paul ;
Nesline, Mary ;
Wang, Yirong ;
Lenzo, Felicia L. ;
Klein, Roger ;
Zhang, Shengle .
PLOS ONE, 2021, 16 (12)
[5]  
Cortini F., 2017, EXPLOR RES HYPOTHESI, V2, P57, DOI [10.14218/erhm.2017.00010, DOI 10.14218/ERHM.2017.00010]
[6]   Best practices for variant calling in clinical sequencing [J].
Koboldt, Daniel C. .
GENOME MEDICINE, 2020, 12 (01)
[7]   VarScan 2: Somatic mutation and copy number alteration discovery in cancer by exome sequencing [J].
Koboldt, Daniel C. ;
Zhang, Qunyuan ;
Larson, David E. ;
Shen, Dong ;
McLellan, Michael D. ;
Lin, Ling ;
Miller, Christopher A. ;
Mardis, Elaine R. ;
Ding, Li ;
Wilson, Richard K. .
GENOME RESEARCH, 2012, 22 (03) :568-576
[8]   SiNVICT: ultra-sensitive detection of single nucleotide variants and indels in circulating tumour DNA [J].
Kockan, Can ;
Hach, Faraz ;
Sarrafi, Iman ;
Bell, Robert H. ;
McConeghy, Brian ;
Beja, Kevin ;
Haegert, Anne ;
Wyatt, Alexander W. ;
Volik, Stanislav V. ;
Chi, Kim N. ;
Collins, Colin C. ;
Sahinalp, Cenk .
BIOINFORMATICS, 2017, 33 (01) :26-34
[9]   The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data [J].
McKenna, Aaron ;
Hanna, Matthew ;
Banks, Eric ;
Sivachenko, Andrey ;
Cibulskis, Kristian ;
Kernytsky, Andrew ;
Garimella, Kiran ;
Altshuler, David ;
Gabriel, Stacey ;
Daly, Mark ;
DePristo, Mark A. .
GENOME RESEARCH, 2010, 20 (09) :1297-1303
[10]   Establishing guidelines to harmonize tumor mutational burden (TMB): in silico assessment of variation in TMB quantification across diagnostic platforms: phase I of the Friends of Cancer Research TMB Harmonization Project [J].
Merino, Diana M. ;
Mcshane, Lisa M. ;
Fabrizio, David ;
Funari, Vincent ;
Chen, Shu-Jen ;
White, James R. ;
Wenz, Paul ;
Baden, Jonathan ;
Barrett, J. Carl ;
Chaudhary, Ruchi ;
Chen, Li ;
Chen, Wangjuh ;
Cheng, Jen-Hao ;
Cyanam, Dinesh ;
Dickey, Jennifer S. ;
Gupta, Vikas ;
Hellmann, Matthew ;
Helman, Elena ;
Li, Yali ;
Maas, Joerg ;
Papin, Arnaud ;
Patidar, Rajesh ;
Quinn, Katie J. ;
Rizvi, Naiyer ;
Tae, Hongseok ;
Ward, Christine ;
Xie, Mingchao ;
Zehir, Ahmet ;
Zhao, Chen ;
Dietel, Manfred ;
Stenzinger, Albrecht ;
Stewart, Mark ;
Allen, Jeff .
JOURNAL FOR IMMUNOTHERAPY OF CANCER, 2020, 8 (01)