Accurity: accurate tumor purity and ploidy inference from tumor-normal WGS data by jointly modelling somatic copy number alterations and heterozygous germline single-nucleotide-variants

被引:18
作者
Luo, Zhihui [1 ]
Fan, Xinping [1 ,2 ]
Su, Yao [1 ]
Huang, Yu S. [1 ,2 ]
机构
[1] Chinese Acad Sci, Shanghai Inst Mat Med, State Key Lab Drug Res, Shanghai 201203, Peoples R China
[2] Univ Chinese Acad Sci, 19A Yuquan Rd, Beijing 100049, Peoples R China
关键词
COMPUTATIONAL METHODS; CANCER; EVOLUTION; EXPRESSION; MUTATION; ENABLES; TISSUE; BIAS;
D O I
10.1093/bioinformatics/bty043
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Tumor purity and ploidy have a substantial impact on next-gen sequence analyses of tumor samples and may alter the biological and clinical interpretation of results. Despite the existence of several computational methods that are dedicated to estimate tumor purity and/or ploidy from The Cancer Genome Atlas (TCGA) tumor-normal whole-genome-sequencing (WGS) data, an accurate, fast and fully-automated method that works in a wide range of sequencing coverage, level of tumor purity and level of intra-tumor heterogeneity, is still missing. Results: We describe a computational method called Accurity that infers tumor purity, tumor cell ploidy and absolute allelic copy numbers for somatic copy number alterations (SCNAs) from tumor-normal WGS data by jointly modelling SCNAs and heterozygous germline singlenucleotide-variants (HGSNVs). Results from both in silico and real sequencing data demonstrated that Accurity is highly accurate and robust, even in low-purity, high-ploidy and low-coverage settings in which several existing methods perform poorly. Accounting for tumor purity and ploidy, Accurity significantly increased signal/noise gaps between different copy numbers. We are hopeful that Accurity is of clinical use for identifying cancer diagnostic biomarkers.
引用
收藏
页码:2004 / 2011
页数:8
相关论文
共 34 条
[1]   Comparative analysis of methods for identifying somatic copy number alterations from deep sequencing data [J].
Alkodsi, Amjad ;
Louhimo, Riku ;
Hautaniemi, Sampsa .
BRIEFINGS IN BIOINFORMATICS, 2015, 16 (02) :242-254
[2]   EXPANDS: expanding ploidy and allele frequency on nested subpopulations [J].
Andor, Noemi ;
Harness, Julie V. ;
Muller, Sabine ;
Mewes, Hans W. ;
Petritsch, Claudia .
BIOINFORMATICS, 2014, 30 (01) :50-60
[3]   Systematic pan-cancer analysis of tumour purity [J].
Aran, Dvir ;
Sirota, Marina ;
Butte, Atul J. .
NATURE COMMUNICATIONS, 2015, 6
[4]   Summarizing and correcting the GC content bias in high-throughput sequencing [J].
Benjamini, Yuval ;
Speed, Terence P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (10) :e72
[5]   The landscape of somatic copy-number alteration across human cancers [J].
Beroukhim, Rameen ;
Mermel, Craig H. ;
Porter, Dale ;
Wei, Guo ;
Raychaudhuri, Soumya ;
Donovan, Jerry ;
Barretina, Jordi ;
Boehm, Jesse S. ;
Dobson, Jennifer ;
Urashima, Mitsuyoshi ;
Mc Henry, Kevin T. ;
Pinchback, Reid M. ;
Ligon, Azra H. ;
Cho, Yoon-Jae ;
Haery, Leila ;
Greulich, Heidi ;
Reich, Michael ;
Winckler, Wendy ;
Lawrence, Michael S. ;
Weir, Barbara A. ;
Tanaka, Kumiko E. ;
Chiang, Derek Y. ;
Bass, Adam J. ;
Loo, Alice ;
Hoffman, Carter ;
Prensner, John ;
Liefeld, Ted ;
Gao, Qing ;
Yecies, Derek ;
Signoretti, Sabina ;
Maher, Elizabeth ;
Kaye, Frederic J. ;
Sasaki, Hidefumi ;
Tepper, Joel E. ;
Fletcher, Jonathan A. ;
Tabernero, Josep ;
Baselga, Jose ;
Tsao, Ming-Sound ;
Demichelis, Francesca ;
Rubin, Mark A. ;
Janne, Pasi A. ;
Daly, Mark J. ;
Nucera, Carmelo ;
Levine, Ross L. ;
Ebert, Benjamin L. ;
Gabriel, Stacey ;
Rustgi, Anil K. ;
Antonescu, Cristina R. ;
Ladanyi, Marc ;
Letai, Anthony .
NATURE, 2010, 463 (7283) :899-905
[6]   Linking oncogenic pathways with therapeutic opportunities [J].
Bild, Andrea H. ;
Potti, Anil ;
Nevins, Joseph R. .
NATURE REVIEWS CANCER, 2006, 6 (09) :735-U13
[7]   Multi-factor data normalization enables the detection of copy number aberrations in amplicon sequencing data [J].
Boeva, Valentina ;
Popova, Tatiana ;
Lienard, Maxime ;
Toffoli, Sebastien ;
Kamal, Maud ;
Le Tourneau, Christophe ;
Gentien, David ;
Servant, Nicolas ;
Gestraud, Pierre ;
Frio, Thomas Rio ;
Hupe, Philippe ;
Barillot, Emmanuel ;
Laes, Jean-Francois .
BIOINFORMATICS, 2014, 30 (24) :3443-3450
[8]   Absolute quantification of somatic DNA alterations in human cancer [J].
Carter, Scott L. ;
Cibulskis, Kristian ;
Helman, Elena ;
McKenna, Aaron ;
Shen, Hui ;
Zack, Travis ;
Laird, Peter W. ;
Onofrio, Robert C. ;
Winckler, Wendy ;
Weir, Barbara A. ;
Beroukhim, Rameen ;
Pellman, David ;
Levine, Douglas A. ;
Lander, Eric S. ;
Meyerson, Matthew ;
Getz, Gad .
NATURE BIOTECHNOLOGY, 2012, 30 (05) :413-+
[9]  
Cronin M, 2011, BIOMARK MED, V5, P293, DOI [10.2217/BMM.11.37, 10.2217/bmm.11.37]
[10]   Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data [J].
Degner, Jacob F. ;
Marioni, John C. ;
Pai, Athma A. ;
Pickrell, Joseph K. ;
Nkadori, Everlyne ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
BIOINFORMATICS, 2009, 25 (24) :3207-3212