CONTRA: copy number analysis for targeted resequencing

被引:270
作者
Li, Jason [1 ]
Lupat, Richard [1 ,2 ]
Amarasinghe, Kaushalya C. [3 ]
Thompson, Ella R. [2 ]
Doyle, Maria A. [1 ]
Ryland, Georgina L. [2 ]
Tothill, Richard W. [4 ]
Halgamuge, Saman K. [3 ]
Campbell, Ian G. [2 ,5 ,6 ]
Gorringe, Kylie L. [2 ,5 ,6 ]
机构
[1] Peter MacCallum Canc Ctr, Bioinformat Core Facil, Melbourne, Vic 3002, Australia
[2] Peter MacCallum Canc Ctr, Victorian Breast Canc Res Consortium, Canc Genet Lab, Melbourne, Vic 3002, Australia
[3] Univ Melbourne, Dept Mech Engn, Parkville, Vic 3010, Australia
[4] Peter MacCallum Canc Ctr, Mol Genom Core Facil, Melbourne, Vic 3002, Australia
[5] Univ Melbourne, Sir Peter MacCallum Dept Oncol, Parkville, Vic 3010, Australia
[6] Univ Melbourne, Dept Pathol, Parkville, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
IDENTIFICATION; CANCER; FRAMEWORK; ACCURATE; CAPTURE; GENE;
D O I
10.1093/bioinformatics/bts146
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Results: We present a method for CNV detection for TR data, including whole-exome capture data. Our method calls copy number gains and losses for each target region based on normalized depth of coverage. Our key strategies include the use of base-level log-ratios to remove GC-content bias, correction for an imbalanced library size effect on log-ratios, and the estimation of log-ratio variations via binning and interpolation. Our methods are made available via CONTRA (COpy Number Targeted Resequencing Analysis), a software package that takes standard alignment formats (BAM/SAM) and outputs in variant call format (VCF4.0), for easy integration with other next-generation sequencing analysis packages. We assessed our methods using samples from seven different target enrichment assays, and evaluated our results using simulated data and real germline data with known CNV genotypes.
引用
收藏
页码:1307 / 1313
页数:7
相关论文
共 20 条
  • [1] CNVnator: An approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing
    Abyzov, Alexej
    Urban, Alexander E.
    Snyder, Michael
    Gerstein, Mark
    [J]. GENOME RESEARCH, 2011, 21 (06) : 974 - 984
  • [2] Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries
    Aird, Daniel
    Ross, Michael G.
    Chen, Wei-Sheng
    Danielsson, Maxwell
    Fennell, Timothy
    Russ, Carsten
    Jaffe, David B.
    Nusbaum, Chad
    Gnirke, Andreas
    [J]. GENOME BIOLOGY, 2011, 12 (02)
  • [3] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [4] Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing
    Campbell, Peter J.
    Stephens, Philip J.
    Pleasance, Erin D.
    O'Meara, Sarah
    Li, Heng
    Santarius, Thomas
    Stebbings, Lucy A.
    Leroy, Catherine
    Edkins, Sarah
    Hardy, Claire
    Teague, Jon W.
    Menzies, Andrew
    Goodhead, Ian
    Turner, Daniel J.
    Clee, Christopher M.
    Quail, Michael A.
    Cox, Antony
    Brown, Clive
    Durbin, Richard
    Hurles, Matthew E.
    Edwards, Paul A. W.
    Bignell, Graham R.
    Stratton, Michael R.
    Futreal, P. Andrew
    [J]. NATURE GENETICS, 2008, 40 (06) : 722 - 729
  • [5] High-resolution mapping of copy-number alterations with massively parallel sequencing
    Chiang, Derek Y.
    Getz, Gad
    Jaffe, David B.
    O'Kelly, Michael J. T.
    Zhao, Xiaojun
    Carter, Scott L.
    Russ, Carsten
    Nusbaum, Chad
    Meyerson, Matthew
    Lander, Eric S.
    [J]. NATURE METHODS, 2009, 6 (01) : 99 - 103
  • [6] CNAseg-a novel framework for identification of copy number changes in cancer from second-generation sequencing data
    Ivakhno, Sergii
    Royce, Tom
    Cox, Anthony J.
    Evers, Dirk J.
    Cheetham, R. Keira
    Tavare, Simon
    [J]. BIOINFORMATICS, 2010, 26 (24) : 3051 - 3058
  • [7] Massively Parallel Sequencing of Exons on the X Chromosome Identifies RBM10 as the Gene that Causes a Syndromic Form of Cleft Palate
    Johnston, Jennifer J.
    Teer, Jamie K.
    Cherukuri, Praveen E.
    Hansen, Nancy F.
    Loftus, Stacie K.
    Chong, Karen
    Mullikin, James C.
    Biesecker, Leslie G.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (05) : 743 - 748
  • [8] Deep and wide digging for binding motifs in ChIP-Seq data
    Kulakovskiy, I. V.
    Boeva, V. A.
    Favorov, A. V.
    Makeev, V. J.
    [J]. BIOINFORMATICS, 2010, 26 (20) : 2622 - 2623
  • [9] Li H, 2009, BIOINFORMATICS, V25, P1094, DOI [10.1093/bioinformatics/btp324, 10.1093/bioinformatics/btp100]
  • [10] The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data
    McKenna, Aaron
    Hanna, Matthew
    Banks, Eric
    Sivachenko, Andrey
    Cibulskis, Kristian
    Kernytsky, Andrew
    Garimella, Kiran
    Altshuler, David
    Gabriel, Stacey
    Daly, Mark
    DePristo, Mark A.
    [J]. GENOME RESEARCH, 2010, 20 (09) : 1297 - 1303