Testing for Rare Variant Associations in the Presence of Missing Data

被引:17
作者
Auer, Paul L. [1 ]
Wang, Gao [2 ]
Leal, Suzanne M. [2 ]
机构
[1] Fred Hutchinson Canc Res Ctr, Publ Hlth Sci Div, Seattle, WA 98104 USA
[2] Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
关键词
rare variant association studies; next-generation sequencing; complex disease;
D O I
10.1002/gepi.21736
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
For studies of genetically complex diseases, many association methods have been developed to analyze rare variants. When variant calls are missing, naive implementation of rare variant association (RVA) methods may lead to inflated type I error rates as well as a reduction in power. To overcome these problems, we developed extensions for four commonly used RVA tests. Data from the National Heart Lung and Blood Institute-Exome Sequencing Project were used to demonstrate that missing variant calls can lead to increased false-positive rates and that the extended RVA methods control type I error without reducing power. We suggest a combined strategy of data filtering based on variant and sample level missing genotypes along with implementation of these extended RVA tests.
引用
收藏
页码:529 / 538
页数:10
相关论文
共 16 条
  • [1] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [2] Haplotype phasing: existing methods and new developments
    Browning, Sharon R.
    Browning, Brian L.
    [J]. NATURE REVIEWS GENETICS, 2011, 12 (10) : 703 - 714
  • [3] Exome sequencing and complex disease: practical aspects of rare variant association studies
    Do, Ron
    Kathiresan, Sekar
    Abecasis, Goncalo R.
    [J]. HUMAN MOLECULAR GENETICS, 2012, 21 : R1 - R9
  • [4] The European Nucleotide Archive
    Leinonen, Rasko
    Akhtar, Ruth
    Birney, Ewan
    Bower, Lawrence
    Cerdeno-Tarraga, Ana
    Cheng, Ying
    Cleland, Iain
    Faruque, Nadeem
    Goodgame, Neil
    Gibson, Richard
    Hoad, Gemma
    Jang, Mikyung
    Pakseresht, Nima
    Plaister, Sheila
    Radhakrishnan, Rajesh
    Reddy, Kethi
    Sobhany, Siamak
    Ten Hoopen, Petra
    Vaughan, Robert
    Zalunin, Vadim
    Cochrane, Guy
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D28 - D31
  • [5] Methods for detecting associations with rare variants for common diseases: Application to analysis of sequence data
    Li, Bingshan
    Leal, Suzanne M.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 83 (03) : 311 - 321
  • [6] Lin DY, 2011, AM J HUM GENET, V89, P354, DOI [10.1016/j.ajhg.2011.07.015, 10.1016/j.ajhg.2011.07.015.]
  • [7] LITTLE R. J., 2019, Statistical analysis with missing data, V793
  • [8] A Novel Adaptive Method for the Analysis of Next-Generation Sequencing Data to Detect Complex Trait Associations with Rare Variants Due to Gene Main Effects and Interactions
    Liu, Dajiang J.
    Leal, Suzanne M.
    [J]. PLOS GENETICS, 2010, 6 (10) : 1 - 14
  • [9] A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic
    Madsen, Bo Eskerod
    Browning, Sharon R.
    [J]. PLOS GENETICS, 2009, 5 (02):
  • [10] The NCBI dbGaP database of genotypes and phenotypes
    Mailman, Matthew D.
    Feolo, Michael
    Jin, Yumi
    Kimura, Masato
    Tryka, Kimberly
    Bagoutdinov, Rinat
    Hao, Luning
    Kiang, Anne
    Paschall, Justin
    Phan, Lon
    Popova, Natalia
    Pretel, Stephanie
    Ziyabari, Lora
    Lee, Moira
    Shao, Yu
    Wang, Zhen Y.
    Sirotkin, Karl
    Ward, Minghong
    Kholodov, Michael
    Zbicz, Kerry
    Beck, Jeffrey
    Kimelman, Michael
    Shevelev, Sergey
    Preuss, Don
    Yaschenko, Eugene
    Graeff, Alan
    Ostell, James
    Sherry, Stephen T.
    [J]. NATURE GENETICS, 2007, 39 (10) : 1181 - 1186