Stability Selection for Genome-Wide Association

被引:46
|
作者
Alexander, David H. [1 ]
Lange, Kenneth [2 ,3 ,4 ]
机构
[1] Univ Calif Los Angeles, David Geffen Sch Med, Dept Biomath, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
关键词
genome-wide association; variable selection; stability selection; the lasso; Wellcome Trust Case Control Consortium data; VARIABLE SELECTION; LASSO; LOCI; RISK;
D O I
10.1002/gepi.20623
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
This article applies the recently proposed "stability selection'' procedure of Meinshausen and Buhlmann to the problem of variable selection in genome-wide association. In particular, it explores whether stability selection can identify new regions of interest originally missed or can call into legitimate question regions originally flagged. Our analysis of the seven data sets of the Wellcome Trust Case-Control Consortium suggests that stability selection effectively controls the family-wise error rate but suffers a loss of power. The extensive correlation structure among SNP markers induced by linkage disequilibrium renders the procedure too conservative, causing it to miss regions known to be highly significant from simple marginal analyses. As a remedy one can aggregate nearby SNPs into groups and select groups rather than individual SNPs. The modified procedure can accurately identify the most important regions of genome-wide association, but in a simulation study it still offers less power than simpler and less computationally intensive methods of marginal association testing. Genet. Epidemiol. 35:722-728, 2011. (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:722 / 728
页数:7
相关论文
共 50 条
  • [11] Statistical selection of biological models for genome-wide association analyses
    Bi, Wenjian
    Kang, Guolian
    Pounds, Stanley B.
    METHODS, 2018, 145 : 67 - 75
  • [12] Statistical Selection of Biological Models for Genome-Wide Association Analyses
    Bi, Wenjian
    Kang, Guolian
    Pounds, Stanley B.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 150 - 157
  • [13] Genome-wide selection and association in animal breeding using ssGBLUP
    Nedel Pertile, Simone Fernanda
    Fonseca e Silva, Fabyano
    Salvian, Mayara
    Mourao, Gerson Barreto
    PESQUISA AGROPECUARIA BRASILEIRA, 2016, 51 (10) : 1729 - 1736
  • [14] SNP Selection Strategies from Genome-Wide Association Studies
    Sinnwell, J. P.
    Schaid, D. J.
    GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 714 - 714
  • [15] Population Substructure and Control Selection in Genome-Wide Association Studies
    Yu, Kai
    Wang, Zhaoming
    Li, Qizhai
    Wacholder, Sholom
    Hunter, David J.
    Hoover, Robert N.
    Chanock, Stephen
    Thomas, Gilles
    PLOS ONE, 2008, 3 (07):
  • [16] Testing and genetic model selection in genome-wide association studies
    Loley, Christina
    Koenig, Inke R.
    Hothorn, Ludwig
    Ziegler, Andreas
    ANNALS OF HUMAN GENETICS, 2012, 76 : 420 - 420
  • [17] Genomic consequences of selection and genome-wide association mapping in soybean
    Zixiang Wen
    John F. Boyse
    Qijian Song
    Perry B. Cregan
    Dechun Wang
    BMC Genomics, 16
  • [18] Genomic consequences of selection and genome-wide association mapping in soybean
    Wen, Zixiang
    Boyse, John F.
    Song, Qijian
    Cregan, Perry B.
    Wang, Dechun
    BMC GENOMICS, 2015, 16
  • [19] Testing and Genetic Model Selection in Genome-Wide Association Studies
    Loley, Christina
    Konig, Inke R.
    Hothorn, Ludwig
    Ziegler, Andreas
    GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 149 - 149
  • [20] Sparse Coding for Feature Selection on Genome-Wide Association Data
    Braenne, Ingrid
    Labusch, Kai
    Mamlouk, Amir Madany
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 337 - 346