An evaluation of copy number variation detection tools for cancer using whole exome sequencing data

被引:108
作者
Zare, Fatima [1 ]
Dow, Michelle [2 ]
Monteleone, Nicholas [1 ]
Hosny, Abdelrahman [1 ]
Nabavi, Sheida [1 ,3 ]
机构
[1] Univ Connecticut, Comp Sci & Engn Dept, Storrs, CT 06269 USA
[2] Univ Calif San Diego, Biomed Informat Dept, San Diego, CA 92103 USA
[3] Univ Connecticut, Inst Syst Genom, Storrs, CT 06269 USA
来源
BMC BIOINFORMATICS | 2017年 / 18卷
基金
美国国家卫生研究院;
关键词
Copy number variation; Whole-exome sequencing; Somatic aberrations; Cancer; STRUCTURAL VARIATION; SPECTRUM DISORDERS; HUMAN GENOME; DISEASE; DISCOVERY; VARIANTS; HEALTH;
D O I
10.1186/s12859-017-1705-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors' complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. Results: In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (similar to 50% - similar to 80%), fair specificity (similar to 70% - similar to 94%) and poor FDRs (similar to 27% - similar to 60%). Also, using simulated data we observed that increasing the coverage more than 10x in exonic regions does not improve the detection power of the tools significantly. Conclusions: The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is necessary. Also, CNV detection development suffers from the lack of a gold standard for performance evaluation. Finally, developing tools with user-friendly user interfaces and visualization features can enhance CNV studies for a broader range of users.
引用
收藏
页数:13
相关论文
共 53 条
[1]   Chromosome aberrations in solid tumors [J].
Albertson, DG ;
Collins, C ;
McCormick, F ;
Gray, JW .
NATURE GENETICS, 2003, 34 (04) :369-376
[2]   Inferring copy number and genotype in tumour exome data [J].
Amarasinghe, Kaushalya C. ;
Li, Jason ;
Hunter, Sally M. ;
Ryland, Georgina L. ;
Cowin, Prue A. ;
Campbell, Ian G. ;
Halgamuge, Saman K. .
BMC GENOMICS, 2014, 15
[3]   CoNVEX: copy number variation estimation in exome sequencing data using HMM [J].
Amarasinghe, Kaushalya C. ;
Li, Jason ;
Halgamuge, Saman K. .
BMC BIOINFORMATICS, 2013, 14
[4]  
[Anonymous], [No title captured]
[5]   CANOES: detecting rare copy number variants from whole exome sequencing data [J].
Backenroth, Daniel ;
Homsy, Jason ;
Murillo, Laura R. ;
Glessner, Joe ;
Lin, Edwin ;
Brueckner, Martina ;
Lifton, Richard ;
Goldmuntz, Elizabeth ;
Chung, Wendy K. ;
Shen, Yufeng .
NUCLEIC ACIDS RESEARCH, 2014, 42 (12) :e97
[6]  
Barnes MR, 2010, METHODS MOL BIOL, V628, P1, DOI 10.1007/978-1-60327-367-1_1
[7]   Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants [J].
Belkadi, Aziz ;
Bolze, Alexandre ;
Itan, Yuval ;
Cobat, Aurelie ;
Vincent, Quentin B. ;
Antipenko, Alexander ;
Shang, Lei ;
Boisson, Bertrand ;
Casanova, Jean-Laurent ;
Abel, Laurent .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (17) :5473-5478
[8]   cnvCapSeq: detecting copy number variation in long-range targeted resequencing data [J].
Bellos, Evangelos ;
Kumar, Vikrant ;
Lin, Clarabelle ;
Maggi, Jordi ;
Phua, Zai Yang ;
Cheng, Ching-Yu ;
Cheung, Chui Ming Gemmy ;
Hibberd, Martin L. ;
Wong, Tien Yin ;
Coin, Lachlan J. M. ;
Davila, Sonia .
NUCLEIC ACIDS RESEARCH, 2014, 42 (20) :e158
[9]   The landscape of somatic copy-number alteration across human cancers [J].
Beroukhim, Rameen ;
Mermel, Craig H. ;
Porter, Dale ;
Wei, Guo ;
Raychaudhuri, Soumya ;
Donovan, Jerry ;
Barretina, Jordi ;
Boehm, Jesse S. ;
Dobson, Jennifer ;
Urashima, Mitsuyoshi ;
Mc Henry, Kevin T. ;
Pinchback, Reid M. ;
Ligon, Azra H. ;
Cho, Yoon-Jae ;
Haery, Leila ;
Greulich, Heidi ;
Reich, Michael ;
Winckler, Wendy ;
Lawrence, Michael S. ;
Weir, Barbara A. ;
Tanaka, Kumiko E. ;
Chiang, Derek Y. ;
Bass, Adam J. ;
Loo, Alice ;
Hoffman, Carter ;
Prensner, John ;
Liefeld, Ted ;
Gao, Qing ;
Yecies, Derek ;
Signoretti, Sabina ;
Maher, Elizabeth ;
Kaye, Frederic J. ;
Sasaki, Hidefumi ;
Tepper, Joel E. ;
Fletcher, Jonathan A. ;
Tabernero, Josep ;
Baselga, Jose ;
Tsao, Ming-Sound ;
Demichelis, Francesca ;
Rubin, Mark A. ;
Janne, Pasi A. ;
Daly, Mark J. ;
Nucera, Carmelo ;
Levine, Ross L. ;
Ebert, Benjamin L. ;
Gabriel, Stacey ;
Rustgi, Anil K. ;
Antonescu, Cristina R. ;
Ladanyi, Marc ;
Letai, Anthony .
NATURE, 2010, 463 (7283) :899-905
[10]   Alzheimer risk associated with a copy number variation in the complement receptor 1 increasing C3b/C4b binding sites [J].
Brouwers, N. ;
Van Cauwenberghe, C. ;
Engelborghs, S. ;
Lambert, J-C ;
Bettens, K. ;
Le Bastard, N. ;
Pasquier, F. ;
Montoya, A. Gil ;
Peeters, K. ;
Mattheijssens, M. ;
Vandenberghe, R. ;
De Deyn, P. P. ;
Cruts, M. ;
Amouyel, P. ;
Sleegers, K. ;
Van Broeckhoven, C. .
MOLECULAR PSYCHIATRY, 2012, 17 (02) :223-233