An evaluation of copy number variation detection tools for cancer using whole exome sequencing data

被引:107
|
作者
Zare, Fatima [1 ]
Dow, Michelle [2 ]
Monteleone, Nicholas [1 ]
Hosny, Abdelrahman [1 ]
Nabavi, Sheida [1 ,3 ]
机构
[1] Univ Connecticut, Comp Sci & Engn Dept, Storrs, CT 06269 USA
[2] Univ Calif San Diego, Biomed Informat Dept, San Diego, CA 92103 USA
[3] Univ Connecticut, Inst Syst Genom, Storrs, CT 06269 USA
来源
BMC BIOINFORMATICS | 2017年 / 18卷
基金
美国国家卫生研究院;
关键词
Copy number variation; Whole-exome sequencing; Somatic aberrations; Cancer; STRUCTURAL VARIATION; SPECTRUM DISORDERS; HUMAN GENOME; DISEASE; DISCOVERY; VARIANTS; HEALTH;
D O I
10.1186/s12859-017-1705-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors' complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. Results: In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (similar to 50% - similar to 80%), fair specificity (similar to 70% - similar to 94%) and poor FDRs (similar to 27% - similar to 60%). Also, using simulated data we observed that increasing the coverage more than 10x in exonic regions does not improve the detection power of the tools significantly. Conclusions: The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is necessary. Also, CNV detection development suffers from the lack of a gold standard for performance evaluation. Finally, developing tools with user-friendly user interfaces and visualization features can enhance CNV studies for a broader range of users.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
    Fatima Zare
    Michelle Dow
    Nicholas Monteleone
    Abdelrahman Hosny
    Sheida Nabavi
    BMC Bioinformatics, 18
  • [2] An Evaluation of Copy Number Variation Detection Tools from Whole-Exome Sequencing Data
    Tan, Renjie
    Wang, Yadong
    Kleinstein, Sarah E.
    Liu, Yongzhuang
    Zhu, Xiaolin
    Guo, Hongzhe
    Jiang, Qinghua
    Allen, Andrew S.
    Zhu, Mingfu
    HUMAN MUTATION, 2014, 35 (07) : 899 - 907
  • [3] Evaluation of Copy Number Variation (CNV) detection methods in whole exome sequencing data
    Zhang, Peng
    Ling, Hua
    Pugh, Elizabeth
    Hetrick, Kurt
    Witmer, Dane
    Sobreira, Nara
    Valle, David
    Doheny, Kimberly
    GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 597 - 597
  • [4] A Comparison of Tools for Copy-Number Variation Detection in Germline Whole Exome and Whole Genome Sequencing Data
    Gabrielaite, Migle
    Torp, Mathias Husted
    Rasmussen, Malthe Sebro
    Andreu-Sanchez, Sergio
    Vieira, Filipe Garrett
    Pedersen, Christina Bligaard
    Kinalis, Savvas
    Madsen, Majbritt Busk
    Kodama, Miyako
    Demircan, Guel Sude
    Simonyan, Arman
    Yde, Christina Westmose
    Olsen, Lars Ronn
    Marvig, Rasmus L.
    ostrup, Olga
    Rossing, Maria
    Nielsen, Finn Cilius
    Winther, Ole
    Bagger, Frederik Otzen
    CANCERS, 2021, 13 (24)
  • [5] Exome sequencing and whole genome sequencing for the detection of copy number variation
    Hehir-Kwa, Jayne Y.
    Pfundt, Rolph
    Veltman, Joris A.
    EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2015, 15 (08) : 1023 - 1032
  • [6] Evaluation of somatic copy number estimation tools for whole-exome sequencing data
    Nam, Jae-Yong
    Kim, Nayoung K. D.
    Kim, Sang Cheol
    Joung, Je-Gun
    Xi, Ruibin
    Lee, Semin
    Park, Peter J.
    Park, Woong-Yang
    BRIEFINGS IN BIOINFORMATICS, 2016, 17 (02) : 185 - 192
  • [7] Comparative study of whole exome sequencing-based copy number variation detection tools
    Lanling Zhao
    Han Liu
    Xiguo Yuan
    Kun Gao
    Junbo Duan
    BMC Bioinformatics, 21
  • [8] Comparative study of whole exome sequencing-based copy number variation detection tools
    Zhao, Lanling
    Liu, Han
    Yuan, Xiguo
    Gao, Kun
    Duan, Junbo
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [9] CODEX: a normalization and copy number variation detection method for whole exome sequencing
    Jiang, Yuchao
    Oldridge, Derek A.
    Diskin, Sharon J.
    Zhang, Nancy R.
    NUCLEIC ACIDS RESEARCH, 2015, 43 (06) : e39
  • [10] Copy Number Analysis of Whole Exome Sequencing Data
    Madubata, Chinwe
    Bi, Xin
    Pang, Jiuhong
    Gu, Yue
    Koganti, Lahari
    Liao, Jun
    Hsiao, Susan
    Aggarwal, Vimla
    Mansukhani, Mahesh
    Jobanputra, Vaidehi
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2024, 162 : S158 - S159