Replication Strategies for Rare Variant Complex Trait Association Studies via Next-Generation Sequencing

被引:45
作者
Liu, Dajiang J. [1 ,2 ]
Leal, Suzanne M. [1 ,2 ]
机构
[1] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
[2] Rice Univ, Dept Stat, Houston, TX 77005 USA
基金
美国国家卫生研究院;
关键词
GENOTYPING ERRORS; COMMON DISEASES; HUMAN GENOME; MUTATIONS; POPULATION; SUSCEPTIBILITY; PROBABILITY; CONTRIBUTE; DISCOVERY; SELECTION;
D O I
10.1016/j.ajhg.2010.10.025
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
There is solid evidence that complex traits can be caused by rare variants. Next-generation sequencing technologies are powerful tools for mapping rare variants. Confirmation of significant findings in stage 1 through replication in an independent stage 2 sample is necessary for association studies. For gene-based mapping of rare variants, two replication strategies are possible: (1) variant-based replication, wherein only variants from nucleotide sites uncovered in stage 1 are genotyped and followed-up and (2) sequence-based replication, wherein the gene region is sequenced in the replication sample and both known and novel variants are tested. The efficiency of the two strategies is dependent on the proportions of causative variants discovered in stage 1 and sequencing/genotyping errors. With rigorous population genetic and phenotypic models, it is demonstrated that sequence-based replication is consistently more powerful. However, the power gain is small (1) for large-scale studies with thousands of individuals, because a large fraction of causative variant sites can be observed and (2) for small- to medium-scale studies with a few hundred samples, because a large proportion of the locus population attributable risk can be explained by the uncovered variants. Therefore, genotyping can be a temporal solution for replicating genetic studies if stage 1 and 2 samples are drawn from the same population. However, sequence-based replication is advantageous if the stage 1 sample is small or novel variants discovery is also of interest. It is shown that currently attainable levels of sequencing error only minimally affect the comparison, and the advantage of sequence-based replication remains.
引用
收藏
页码:790 / 801
页数:12
相关论文
共 40 条
[1]  
[Anonymous], Online Mendelian Inheritance in Man
[2]   Accurate detection and genotyping of SNPs utilizing population sequencing data [J].
Bansal, Vikas ;
Harismendy, Olivier ;
Tewhey, Ryan ;
Murray, Sarah S. ;
Schork, Nicholas J. ;
Topol, Eric J. ;
Frazer, Kelly A. .
GENOME RESEARCH, 2010, 20 (04) :537-545
[3]   Whole-exome sequencing identifies recessive WDR62 mutations in severe brain malformations [J].
Bilguvar, Kaya ;
Ozturk, Ali Kemal ;
Louvi, Angeliki ;
Kwan, Kenneth Y. ;
Choi, Murim ;
Tatli, Burak ;
Yalnizoglu, Dilek ;
Tuysuz, Beyhan ;
Caglayan, Ahmet Okay ;
Gokben, Sarenur ;
Kaymakcalan, Hande ;
Barak, Tanyeri ;
Bakircioglu, Mehmet ;
Yasuno, Katsuhito ;
Ho, Winson ;
Sanders, Stephan ;
Zhu, Ying ;
Yilmaz, Sanem ;
Dincer, Alp ;
Johnson, Michele H. ;
Bronen, Richard A. ;
Kocer, Naci ;
Per, Hueseyin ;
Mane, Shrikant ;
Pamir, Mehmet Necmettin ;
Yalcinkaya, Cengiz ;
Kumandas, Sefer ;
Topcu, Meral ;
Ozmen, Meral ;
Sestan, Nenad ;
Lifton, Richard P. ;
State, Matthew W. ;
Gunel, Murat .
NATURE, 2010, 467 (7312) :207-U93
[4]   Common and rare variants in multifactorial susceptibility to common diseases [J].
Bodmer, Walter ;
Bonilla, Carolina .
NATURE GENETICS, 2008, 40 (06) :695-701
[5]   Assessing the evolutionary impact of amino acid mutations in the human genome [J].
Boyko, Adam R. ;
Williamson, Scott H. ;
Indap, Amit R. ;
Degenhardt, Jeremiah D. ;
Hernandez, Ryan D. ;
Lohmueller, Kirk E. ;
Adams, Mark D. ;
Schmidt, Steffen ;
Sninsky, John J. ;
Sunyaev, Shamil R. ;
White, Thomas J. ;
Nielsen, Rasmus ;
Clark, Andrew G. ;
Bustamante, Carlos D. .
PLOS GENETICS, 2008, 4 (05)
[6]   Prevalence of hepatic steatosis in an urban population in the United States: Impact of ethnicity [J].
Browning, JD ;
Szczepaniak, LS ;
Dobbins, R ;
Nuremberg, P ;
Horton, JD ;
Cohen, JC ;
Grundy, SM ;
Hobbs, HH .
HEPATOLOGY, 2004, 40 (06) :1387-1395
[7]   Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels [J].
Cohen, JC ;
Pertsemlidis, A ;
Fahmi, S ;
Esmail, S ;
Vega, GL ;
Grundy, SM ;
Hobbs, HH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (06) :1810-1815
[8]   Multiple rare Alleles contribute to low plasma levels of HDL cholesterol [J].
Cohen, JC ;
Kiss, RS ;
Pertsemlidis, A ;
Marcel, YL ;
McPherson, R ;
Hobbs, HH .
SCIENCE, 2004, 305 (5685) :869-872
[9]   HLA-B☆5701 genotype is a major determinant of drug-induced liver injury due to flucloxacillin [J].
Daly, Ann K. ;
Donaldson, Peter T. ;
Bhatnagar, Pallav ;
Shen, Yufeng ;
Pe'er, Itsik ;
Floratos, Aris ;
Daly, Mark J. ;
Goldstein, David B. ;
John, Sally ;
Nelson, Matthew R. ;
Graham, Julia ;
Park, B. Kevin ;
Dillon, John F. ;
Bernal, William ;
Cordell, Heather J. ;
Pirmohamed, Munir ;
Aithal, Guruprasad P. ;
Day, Christopher P. .
NATURE GENETICS, 2009, 41 (07) :816-U71
[10]   A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data [J].
Douglas, JA ;
Boehnke, M ;
Lange, K .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (04) :1287-1297