Systematic assessment of copy number variant detection via genome-wide SNP genotyping

被引:154
作者
Cooper, Gregory M. [1 ]
Zerr, Troy [1 ]
Kidd, Jeffrey M. [1 ]
Eichler, Evan E. [1 ,2 ]
Nickerson, Deborah A. [1 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Howard Hughes Med Inst, Chevy Chase, MD USA
基金
美国国家科学基金会;
关键词
D O I
10.1038/ng.236
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
SNP genotyping has emerged as a technology to incorporate copy number variants (CNVs) into genetic analyses of human traits. However, the extent to which SNP platforms accurately capture CNVs remains unclear. Using independent, sequence-based CNV maps, we find that commonly used SNP platforms have limited or no probe coverage for a large fraction of CNVs. Despite this, in 9 samples we inferred 368 CNVs using Illumina SNP genotyping data and experimentally validated over two-thirds of these. We also developed a method (SNP-Conditional Mixture Modeling, SCIMM) to robustly genotype deletions using as few as two SNP probes. We find that HapMap SNPs are strongly correlated with 82% of common deletions, but the newest SNP platforms effectively tag about 50%. We conclude that currently available genome-wide SNP assays can capture CNVs accurately, but improvements in array designs, particularly in duplicated sequences, are necessary to facilitate more comprehensive analyses of genomic variation.
引用
收藏
页码:1199 / 1203
页数:5
相关论文
共 30 条
[1]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[3]   QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data [J].
Colella, Stefano ;
Yau, Christopher ;
Taylor, Jennifer M. ;
Mirza, Ghazala ;
Butler, Helen ;
Clouston, Penny ;
Bassett, Anne S. ;
Seller, Anneke ;
Holmes, Christopher C. ;
Ragoussis, Jiannis .
NUCLEIC ACIDS RESEARCH, 2007, 35 (06) :2013-2025
[4]   A high-resolution survey of deletion polymorphism in the human genome [J].
Conrad, DF ;
Andrews, TD ;
Carter, NP ;
Hurles, ME ;
Pritchard, JK .
NATURE GENETICS, 2006, 38 (01) :75-81
[5]   Mutational and selective effects on copy-number variants in the human genome [J].
Cooper, Gregory M. ;
Nickerson, Deborah A. ;
Eichler, Evan E. .
NATURE GENETICS, 2007, 39 (Suppl 7) :S22-S29
[6]   Unsupervised segmentation of continuous genomic data [J].
Day, Nathan ;
Hemmaplardh, Andrew ;
Thurman, Robert E. ;
Stamatoyannopoulos, John A. ;
Noble, William S. .
BIOINFORMATICS, 2007, 23 (11) :1424-1426
[7]   Array CGH analysis of copy number variation identifies 1284 new genes variant in healthy white males: implications for association studies of complex diseases [J].
de Smith, Adam J. ;
Tsalenko, Anya ;
Sampas, Nick ;
Scheffer, Alicia ;
Yamada, N. Alice ;
Tsang, Peter ;
Ben-Dor, Amir ;
Yakhini, Zohar ;
Ellis, Richard J. ;
Bruhn, Laurakay ;
Laderman, Stephen ;
Froguel, Philippe ;
Blakemore, Alexandra I. F. .
HUMAN MOLECULAR GENETICS, 2007, 16 (23) :2783-2794
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Completing the map of human genetic variation [J].
Eichler, Evan E. ;
Nickerson, Deborah A. ;
Altshuler, David ;
Bowcock, Anne M. ;
Brooks, Lisa D. ;
Carter, Nigel P. ;
Church, Deanna M. ;
Felsenfeld, Adam ;
Guyer, Mark ;
Lee, Charles ;
Lupski, James R. ;
Mullikin, James C. ;
Pritchard, Jonathan K. ;
Sebat, Jonathan ;
Sherry, Stephen T. ;
Smith, Douglas ;
Valle, David ;
Waterston, Robert H. .
NATURE, 2007, 447 (7141) :161-165
[10]   Copy number variants and common disorders: Filling the gaps and exploring complexity in genome-wide association studies [J].
Estivill, Xavier ;
Armengol, Lluis .
PLOS GENETICS, 2007, 3 (10) :1787-1799