CONTRA: copy number analysis for targeted resequencing

被引:271
作者
Li, Jason [1 ]
Lupat, Richard [1 ,2 ]
Amarasinghe, Kaushalya C. [3 ]
Thompson, Ella R. [2 ]
Doyle, Maria A. [1 ]
Ryland, Georgina L. [2 ]
Tothill, Richard W. [4 ]
Halgamuge, Saman K. [3 ]
Campbell, Ian G. [2 ,5 ,6 ]
Gorringe, Kylie L. [2 ,5 ,6 ]
机构
[1] Peter MacCallum Canc Ctr, Bioinformat Core Facil, Melbourne, Vic 3002, Australia
[2] Peter MacCallum Canc Ctr, Victorian Breast Canc Res Consortium, Canc Genet Lab, Melbourne, Vic 3002, Australia
[3] Univ Melbourne, Dept Mech Engn, Parkville, Vic 3010, Australia
[4] Peter MacCallum Canc Ctr, Mol Genom Core Facil, Melbourne, Vic 3002, Australia
[5] Univ Melbourne, Sir Peter MacCallum Dept Oncol, Parkville, Vic 3010, Australia
[6] Univ Melbourne, Dept Pathol, Parkville, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
IDENTIFICATION; CANCER; FRAMEWORK; ACCURATE; CAPTURE; GENE;
D O I
10.1093/bioinformatics/bts146
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Results: We present a method for CNV detection for TR data, including whole-exome capture data. Our method calls copy number gains and losses for each target region based on normalized depth of coverage. Our key strategies include the use of base-level log-ratios to remove GC-content bias, correction for an imbalanced library size effect on log-ratios, and the estimation of log-ratio variations via binning and interpolation. Our methods are made available via CONTRA (COpy Number Targeted Resequencing Analysis), a software package that takes standard alignment formats (BAM/SAM) and outputs in variant call format (VCF4.0), for easy integration with other next-generation sequencing analysis packages. We assessed our methods using samples from seven different target enrichment assays, and evaluated our results using simulated data and real germline data with known CNV genotypes.
引用
收藏
页码:1307 / 1313
页数:7
相关论文
共 20 条
[1]   CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing [J].
Abyzov, Alexej ;
Urban, Alexander E. ;
Snyder, Michael ;
Gerstein, Mark .
GENOME RESEARCH, 2011, 21 (06) :974-984
[2]   Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries [J].
Aird, Daniel ;
Ross, Michael G. ;
Chen, Wei-Sheng ;
Danielsson, Maxwell ;
Fennell, Timothy ;
Russ, Carsten ;
Jaffe, David B. ;
Nusbaum, Chad ;
Gnirke, Andreas .
GENOME BIOLOGY, 2011, 12 (02)
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing [J].
Campbell, Peter J. ;
Stephens, Philip J. ;
Pleasance, Erin D. ;
O'Meara, Sarah ;
Li, Heng ;
Santarius, Thomas ;
Stebbings, Lucy A. ;
Leroy, Catherine ;
Edkins, Sarah ;
Hardy, Claire ;
Teague, Jon W. ;
Menzies, Andrew ;
Goodhead, Ian ;
Turner, Daniel J. ;
Clee, Christopher M. ;
Quail, Michael A. ;
Cox, Antony ;
Brown, Clive ;
Durbin, Richard ;
Hurles, Matthew E. ;
Edwards, Paul A. W. ;
Bignell, Graham R. ;
Stratton, Michael R. ;
Futreal, P. Andrew .
NATURE GENETICS, 2008, 40 (06) :722-729
[5]   High-resolution mapping of copy-number alterations with massively parallel sequencing [J].
Chiang, Derek Y. ;
Getz, Gad ;
Jaffe, David B. ;
O'Kelly, Michael J. T. ;
Zhao, Xiaojun ;
Carter, Scott L. ;
Russ, Carsten ;
Nusbaum, Chad ;
Meyerson, Matthew ;
Lander, Eric S. .
NATURE METHODS, 2009, 6 (01) :99-103
[6]   CNAseg-a novel framework for identification of copy number changes in cancer from second-generation sequencing data [J].
Ivakhno, Sergii ;
Royce, Tom ;
Cox, Anthony J. ;
Evers, Dirk J. ;
Cheetham, R. Keira ;
Tavare, Simon .
BIOINFORMATICS, 2010, 26 (24) :3051-3058
[7]   Massively Parallel Sequencing of Exons on the X Chromosome Identifies RBM10 as the Gene that Causes a Syndromic Form of Cleft Palate [J].
Johnston, Jennifer J. ;
Teer, Jamie K. ;
Cherukuri, Praveen E. ;
Hansen, Nancy F. ;
Loftus, Stacie K. ;
Chong, Karen ;
Mullikin, James C. ;
Biesecker, Leslie G. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (05) :743-748
[8]   Deep and wide digging for binding motifs in ChIP-Seq data [J].
Kulakovskiy, I. V. ;
Boeva, V. A. ;
Favorov, A. V. ;
Makeev, V. J. .
BIOINFORMATICS, 2010, 26 (20) :2622-2623
[9]  
Li H, 2009, BIOINFORMATICS, V25, P1094, DOI [10.1093/bioinformatics/btp324, 10.1093/bioinformatics/btp100]
[10]   The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data [J].
McKenna, Aaron ;
Hanna, Matthew ;
Banks, Eric ;
Sivachenko, Andrey ;
Cibulskis, Kristian ;
Kernytsky, Andrew ;
Garimella, Kiran ;
Altshuler, David ;
Gabriel, Stacey ;
Daly, Mark ;
DePristo, Mark A. .
GENOME RESEARCH, 2010, 20 (09) :1297-1303