Performance assessment of copy number microarray platforms using a spike-in experiment

被引:15
作者
Halper-Stromberg, Eitan [1 ,2 ]
Frelin, Laurence [3 ]
Ruczinski, Ingo [1 ]
Scharpf, Robert [1 ]
Jie, Chunfa [4 ]
Carvalho, Benilton [5 ]
Hao, Haiping [4 ]
Hetrick, Kurt [6 ]
Jedlicka, Anne [7 ]
Dziedzic, Amanda [7 ]
Doheny, Kim [6 ]
Scott, Alan F. [6 ]
Baylin, Steve [8 ]
Pevsner, Jonathan [3 ,9 ]
Spencer, Forrest [10 ]
Irizarry, Rafael A. [1 ]
机构
[1] Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD USA
[2] Johns Hopkins Univ, Sch Med, Program Human Genet & Mol Biol, Baltimore, MD USA
[3] Hugo W Moser Res Inst Kennedy Krieger, Dept Neurol, Baltimore, MD USA
[4] Johns Hopkins Univ, Sch Med, High Throughput Biol Ctr, JHMI Microarray Core, Baltimore, MD USA
[5] Univ Cambridge, CRUK Cambridge Res Inst, Li Ka Shing Ctr, Dept Oncol, Cambridge CB2 0RE, England
[6] Johns Hopkins Univ, Ctr Inherited Dis Res, Baltimore, MD USA
[7] Johns Hopkins Univ, Dept Mol Microbiol & Immunol, Johns Hopkins Malaria Res Inst, Johns Hopkins Bloomberg Sch Publ Hlth, Baltimore, MD USA
[8] Johns Hopkins Univ, Sch Med, Dept Oncol, Baltimore, MD 21205 USA
[9] Johns Hopkins Univ, Sch Med, Dept Neurosci, Baltimore, MD 21205 USA
[10] Johns Hopkins Univ, Sch Med, McKusick Nathans Inst Genet Med, Baltimore, MD 21205 USA
基金
美国国家卫生研究院;
关键词
HIDDEN-MARKOV MODEL; NORMALIZATION; ALGORITHMS; DETECT;
D O I
10.1093/bioinformatics/btr106
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Changes in the copy number of chromosomal DNA segments [copy number variants (CNVs)] have been implicated in human variation, heritable diseases and cancers. Microarray-based platforms are the current established technology of choice for studies reporting these discoveries and constitute the benchmark against which emergent sequence-based approaches will be evaluated. Research that depends on CNV analysis is rapidly increasing, and systematic platform assessments that distinguish strengths and weaknesses are needed to guide informed choice. Results: We evaluated the sensitivity and specificity of six platforms, provided by four leading vendors, using a spike-in experiment. NimbleGen and Agilent platforms outperformed Illumina and Affymetrix in accuracy and precision of copy number dosage estimates. However, Illumina and Affymetrix algorithms that leverage single nucleotide polymorphism (SNP) information make up for this disadvantage and perform well at variant detection. Overall, the NimbleGen 2.1M platform outperformed others, but only with the use of an alternative data analysis pipeline to the one offered by the manufacturer.
引用
收藏
页码:1052 / 1060
页数:9
相关论文
共 36 条
[1]   Methods and strategies for analyzing copy number variation using DNA microarrays [J].
Carter, Nigel P. .
NATURE GENETICS, 2007, 39 (Suppl 7) :S16-S21
[2]   Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data [J].
Carvalho, Benilton ;
Bengtsson, Henrik ;
Speed, Terence P. ;
Irizarry, Rafael A. .
BIOSTATISTICS, 2007, 8 (02) :485-499
[3]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[4]   QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data [J].
Colella, Stefano ;
Yau, Christopher ;
Taylor, Jennifer M. ;
Mirza, Ghazala ;
Butler, Helen ;
Clouston, Penny ;
Bassett, Anne S. ;
Seller, Anneke ;
Holmes, Christopher C. ;
Ragoussis, Jiannis .
NUCLEIC ACIDS RESEARCH, 2007, 35 (06) :2013-2025
[5]   Origins and functional impact of copy number variation in the human genome [J].
Conrad, Donald F. ;
Pinto, Dalila ;
Redon, Richard ;
Feuk, Lars ;
Gokcumen, Omer ;
Zhang, Yujun ;
Aerts, Jan ;
Andrews, T. Daniel ;
Barnes, Chris ;
Campbell, Peter ;
Fitzgerald, Tomas ;
Hu, Min ;
Ihm, Chun Hwa ;
Kristiansson, Kati ;
MacArthur, Daniel G. ;
MacDonald, Jeffrey R. ;
Onyiah, Ifejinelo ;
Pang, Andy Wing Chun ;
Robson, Sam ;
Stirrups, Kathy ;
Valsesia, Armand ;
Walter, Klaudia ;
Wei, John ;
Tyler-Smith, Chris ;
Carter, Nigel P. ;
Lee, Charles ;
Scherer, Stephen W. ;
Hurles, Matthew E. .
NATURE, 2010, 464 (7289) :704-712
[6]   Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarrays [J].
Di, XJ ;
Matsuzaki, H ;
Webster, TA ;
Hubbell, E ;
Liu, GY ;
Dong, SL ;
Bartell, D ;
Huang, J ;
Chiles, R ;
Yang, G ;
Shen, MM ;
Kulp, D ;
Kennedy, GC ;
Mei, R ;
Jones, KW ;
Cawley, S .
BIOINFORMATICS, 2005, 21 (09) :1958-1963
[7]   Adjustment of genomic waves in signal intensities from whole-genome SNP genotyping platforms [J].
Diskin, Sharon J. ;
Li, Mingyao ;
Hou, Cuiping ;
Yang, Shuzhang ;
Glessner, Joseph ;
Hakonarson, Hakon ;
Bucan, Maja ;
Maris, John M. ;
Wang, Kai .
NUCLEIC ACIDS RESEARCH, 2008, 36 (19)
[8]   Statistical issues in the analysis of Illumina data [J].
Dunning, Mark J. ;
Barbosa-Morais, Nuno L. ;
Lynch, Andy G. ;
Tavare, Simon ;
Ritchie, Matthew E. .
BMC BIOINFORMATICS, 2008, 9 (1)
[9]   Characterizing natural variation using next-generation sequencing technologies [J].
Gilad, Yoav ;
Pritchard, Jonathan K. ;
Thornton, Kevin .
TRENDS IN GENETICS, 2009, 25 (10) :463-471
[10]   PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data [J].
Greenman, Chris D. ;
Bignell, Graham ;
Butler, Adam ;
Edkins, Sarah ;
Hinton, Jon ;
Beare, Dave ;
Swamy, Sajani ;
Santarius, Thomas ;
Chen, Lina ;
Widaa, Sara ;
Futreal, P. Andy ;
Stratton, Michael R. .
BIOSTATISTICS, 2010, 11 (01) :164-175