Joint Identification of Multiple Genetic Variants via Elastic-Net Variable Selection in a Genome-Wide Association Analysis

被引:73
作者
Cho, Seoae
Kim, Kyunga [2 ]
Kim, Young Jin
Lee, Jong-Keuk [3 ]
Cho, Yoon Shin
Lee, Jong-Young
Han, Bok-Ghee
Kim, Heebal [4 ]
Ott, Jurg [5 ]
Park, Taesung [1 ,6 ]
机构
[1] Seoul Natl Univ, Dept Stat, Interdisciplinary Program Bioinformat, Seoul 151747, South Korea
[2] Sookmyung Womens Univ, Dept Stat, Seoul 140742, South Korea
[3] Univ Ulsan, Coll Med, Asan Inst Life Sci, Ulsan 138736, South Korea
[4] Seoul Natl Univ, Dept Agr Biotechnol, Seoul 151921, South Korea
[5] Beijing Inst Genom, Beijing 100029, Peoples R China
[6] Seoul Natl Univ, Dept Stat, Seoul 151747, South Korea
基金
新加坡国家研究基金会; 美国国家科学基金会;
关键词
Genome-wide association; multiple regression; elastic-net variable selection; empirical replication; adult height; IGF-I GENE; SEQUENCE VARIANTS; ADULT HEIGHT; LOCI; POLYMORPHISMS; LASSO; REGRESSION; RISK; REGULARIZATION;
D O I
10.1111/j.1469-1809.2010.00597.x
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
P>Unraveling the genetic background of common complex traits is a major goal in modern genetics. In recent years, genome-wide association (GWA) studies have been conducted with large-scale data sets of genetic variants. Most of those studies have relied on single-marker approaches that identify single genetic factors individually and can be limited in considering fully the joint effects of multiple genetic factors on complex traits. Joint identification of multiple genetic factors would be more powerful and would provide better prediction on complex traits since it utilizes combined information across variants. Here we propose a multi-stage approach for GWA analysis: (1) prescreening, (2) joint identification of putative SNPs based on elastic-net variable selection, and (3) empirical replication using bootstrap samples. Our approach enables an efficient joint search for genetic associations in GWA analysis. The suggested empirical replication method can be beneficial in GWA studies because one can avoid a costly, independent replication study while eliminating false-positive associations and focusing on a smaller number of replicable variants. We applied the proposed approach to a GWA analysis, and jointly identified 129 genetic variants having an association with adult height in a Korean population.
引用
收藏
页码:416 / 428
页数:13
相关论文
共 51 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]   TheEGF receptor family: spearheading a merger of signaling and therapeutics [J].
Bublil, Erez M. ;
Yarden, Yosef .
CURRENT OPINION IN CELL BIOLOGY, 2007, 19 (02) :124-134
[3]   Structure of the extracellular region of HER3 reveals an interdomain tether [J].
Cho, HS ;
Leahy, DJ .
SCIENCE, 2002, 297 (5585) :1330-1333
[4]   A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits [J].
Cho, Yoon Shin ;
Go, Min Jin ;
Kim, Young Jin ;
Heo, Jee Yeon ;
Oh, Ji Hee ;
Ban, Hyo-Jeong ;
Yoon, Dankyu ;
Lee, Mi Hee ;
Kim, Dong-Joon ;
Park, Miey ;
Cha, Seung-Hun ;
Kim, Jun-Woo ;
Han, Bok-Ghee ;
Min, Haesook ;
Ahn, Younjhin ;
Park, Man Suk ;
Han, Hye Ree ;
Jang, Hye-Yoon ;
Cho, Eun Young ;
Lee, Jong-Eun ;
Cho, Nam H. ;
Shin, Chol ;
Park, Taesung ;
Park, Ji Wan ;
Lee, Jong-Keuk ;
Cardon, Lon ;
Clarke, Geraldine ;
McCarthy, Mark I. ;
Lee, Jong-Young ;
Lee, Jong-Koo ;
Oh, Bermseok ;
Kim, Hyung-Lae .
NATURE GENETICS, 2009, 41 (05) :527-534
[5]   Neuregulin 1-erbB signaling and the molecular/cellular basis of schizophrenia [J].
Corfas, G ;
Roy, K ;
Buxbaum, J .
NATURE NEUROSCIENCE, 2004, 7 (06) :575-580
[6]   CONFIDENCE-INTERVALS IN RIDGE-REGRESSION BY BOOTSTRAPPING THE DEPENDENT VARIABLE - A SIMULATION STUDY [J].
CRIVELLI, A ;
FIRINGUETTI, L ;
MONTANO, R ;
MUNOZ, M .
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1995, 24 (03) :631-652
[7]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[8]  
Friedman J., 2010, J. Stat. Software, V33
[9]   Common Genetic Variation and Human Traits [J].
Goldstein, David B. .
NEW ENGLAND JOURNAL OF MEDICINE, 2009, 360 (17) :1696-1698
[10]   Many sequence variants affecting diversity of adult human height [J].
Gudbjartsson, Daniel F. ;
Walters, G. Bragi ;
Thorleifsson, Gudmar ;
Stefansson, Hreinn ;
Halldorsson, Bjarni V. ;
Zusmanovich, Pasha ;
Sulem, Patrick ;
Thorlacius, Steinunn ;
Gylfason, Arnaldur ;
Steinberg, Stacy ;
Helgadottir, Anna ;
Ingason, Andres ;
Steinthorsdottir, Valgerdur ;
Olafsdottir, Elinborg J. ;
Olafsdottir, Gudridur H. ;
Jonsson, Thorvaldur ;
Borch-Johnsen, Knut ;
Hansen, Torben ;
Andersen, Gitte ;
Jorgensen, Torben ;
Pedersen, Oluf ;
Aben, Katja K. ;
Witjes, J. Alfred ;
Swinkels, Dorine W. ;
den Heijer, Martin ;
Franke, Barbara ;
Verbeek, Andre L. M. ;
Becker, Diane M. ;
Yanek, Lisa R. ;
Becker, Lewis C. ;
Tryggvadottir, Laufey ;
Rafnar, Thorunn ;
Gulcher, Jeffrey ;
Kiemeney, Lambertus A. ;
Kong, Augustine ;
Thorsteinsdottir, Unnur ;
Stefansson, Kari .
NATURE GENETICS, 2008, 40 (05) :609-615