Joint Identification of Multiple Genetic Variants via Elastic-Net Variable Selection in a Genome-Wide Association Analysis

被引:73
|
作者
Cho, Seoae
Kim, Kyunga [2 ]
Kim, Young Jin
Lee, Jong-Keuk [3 ]
Cho, Yoon Shin
Lee, Jong-Young
Han, Bok-Ghee
Kim, Heebal [4 ]
Ott, Jurg [5 ]
Park, Taesung [1 ,6 ]
机构
[1] Seoul Natl Univ, Dept Stat, Interdisciplinary Program Bioinformat, Seoul 151747, South Korea
[2] Sookmyung Womens Univ, Dept Stat, Seoul 140742, South Korea
[3] Univ Ulsan, Coll Med, Asan Inst Life Sci, Ulsan 138736, South Korea
[4] Seoul Natl Univ, Dept Agr Biotechnol, Seoul 151921, South Korea
[5] Beijing Inst Genom, Beijing 100029, Peoples R China
[6] Seoul Natl Univ, Dept Stat, Seoul 151747, South Korea
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
Genome-wide association; multiple regression; elastic-net variable selection; empirical replication; adult height; IGF-I GENE; SEQUENCE VARIANTS; ADULT HEIGHT; LOCI; POLYMORPHISMS; LASSO; REGRESSION; RISK; REGULARIZATION;
D O I
10.1111/j.1469-1809.2010.00597.x
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
P>Unraveling the genetic background of common complex traits is a major goal in modern genetics. In recent years, genome-wide association (GWA) studies have been conducted with large-scale data sets of genetic variants. Most of those studies have relied on single-marker approaches that identify single genetic factors individually and can be limited in considering fully the joint effects of multiple genetic factors on complex traits. Joint identification of multiple genetic factors would be more powerful and would provide better prediction on complex traits since it utilizes combined information across variants. Here we propose a multi-stage approach for GWA analysis: (1) prescreening, (2) joint identification of putative SNPs based on elastic-net variable selection, and (3) empirical replication using bootstrap samples. Our approach enables an efficient joint search for genetic associations in GWA analysis. The suggested empirical replication method can be beneficial in GWA studies because one can avoid a costly, independent replication study while eliminating false-positive associations and focusing on a smaller number of replicable variants. We applied the proposed approach to a GWA analysis, and jointly identified 129 genetic variants having an association with adult height in a Korean population.
引用
收藏
页码:416 / 428
页数:13
相关论文
共 50 条
  • [1] A variable selection method for genome-wide association studies
    He, Qianchuan
    Lin, Dan-Yu
    BIOINFORMATICS, 2011, 27 (01) : 1 - 8
  • [2] Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection
    Lu, Zhao-Hua
    Zhu, Hongtu
    Knickmeyer, Rebecca C.
    Sullivan, Patrick F.
    Williams, Stephanie N.
    Zou, Fei
    GENETIC EPIDEMIOLOGY, 2015, 39 (08) : 664 - 677
  • [3] Identification of genetic variants associated with diabetic kidney disease in multiple Korean cohorts via a genome-wide association study mega-analysis
    Jin, Heejin
    Kim, Ye An
    Lee, Young
    Kwon, Seung-hyun
    Do, Ah Ra
    Seo, Sujin
    Won, Sungho
    Seo, Je Hyun
    BMC MEDICINE, 2023, 21 (01)
  • [4] Nonnegative estimation and variable selection via adaptive elastic-net for high-dimensional data
    Li, Ning
    Yang, Hu
    Yang, Jing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (12) : 4263 - 4279
  • [5] Structured Genome-Wide Association Studies with Bayesian Hierarchical Variable Selection
    Zhao, Yize
    Zhu, Hongtu
    Lu, Zhaohua
    Knickmeyer, Rebecca C.
    Zou, Fei
    GENETICS, 2019, 212 (02) : 397 - 415
  • [6] Finding associated variants in genome-wide association studies on multiple traits
    Gai, Lisa
    Eskin, Eleazar
    BIOINFORMATICS, 2018, 34 (13) : 467 - 474
  • [7] Integrated analysis of genome-wide genetic and epigenetic association data for identification of disease mechanisms
    Ke, Xiayi
    Cortina-Borja, Mario
    Silva, Bruno Cesar
    Lowe, Robert
    Rakyan, Vardhman
    Balding, David
    EPIGENETICS, 2013, 8 (11) : 1236 - 1244
  • [8] Genetic Variants, Cardiovascular Risk and Genome-Wide Association Studies
    Companioni, Osmel
    Rodriguez Esparragon, Francisco
    Medina Fernandez-Aceituno, Alfonso
    Rodriguez Perez, Jose Carlos
    REVISTA ESPANOLA DE CARDIOLOGIA, 2011, 64 (06): : 509 - 514
  • [9] Genome-wide pathway analysis of a genome-wide association study on multiple sclerosis
    Song, Gwan Gyu
    Choi, Sung Jae
    Ji, Jong Dae
    Lee, Young Ho
    MOLECULAR BIOLOGY REPORTS, 2013, 40 (03) : 2557 - 2564
  • [10] A Genome-Wide Association Study of Genetic Variants of Apolipoprotein A1 Levels and Their Association with Vitamin D in Korean Cohorts
    Lee, Young
    Yoon, Ji Won
    Kim, Ye An
    Choi, Hyuk Jin
    Yoon, Byung Woo
    Seo, Je Hyun
    GENES, 2022, 13 (09)