Selection of important variables by statistical learning in genome-wide association analysis

被引:0
|
作者
Wei (Will) Yang
C Charles Gu
机构
[1] Washington University School of Medicine,Division of Biostatistics
[2] Washington University School of Medicine,Department of Genetics
关键词
Bayesian Network; Random Forest; Coronary Artery Calcification; Risk SNPs; Random Forest Analysis;
D O I
10.1186/1753-6561-3-S7-S70
中图分类号
学科分类号
摘要
Genetic analysis of complex diseases demands novel analytical methods to interpret data collected on thousands of variables by genome-wide association studies. The complexity of such analysis is multiplied when one has to consider interaction effects, be they among the genetic variations (G × G) or with environment risk factors (G × E). Several statistical learning methods seem quite promising in this context. Herein we consider applications of two such methods, random forest and Bayesian networks, to the simulated dataset for Genetic Analysis Workshop 16 Problem 3. Our evaluation study showed that an iterative search based on the random forest approach has the potential in selecting important variables, while Bayesian networks can capture some of the underlying causal relationships.
引用
收藏
相关论文
共 50 条
  • [41] Collaborative genome-wide association analysis with cryptography
    Cho, Hyunghoon
    Berger, Bonnie
    NATURE GENETICS, 2025, : 780 - 781
  • [42] Genome-wide association analysis of psoriatic arthritis
    Nair, R.
    Stuart, P.
    Tsoi, L.
    Ellinghaus, E.
    Walsh, J.
    Chandran, V.
    Tejasvi, T.
    Esko, T.
    Duffin, K.
    Ike, R.
    Bowcock, A.
    Voorhees, J.
    Lim, H.
    Weichenthal, M.
    Franke, A.
    Rahman, P.
    Krueger, G.
    Abecasis, G.
    Gladman, D.
    Elder, J.
    BRITISH JOURNAL OF DERMATOLOGY, 2014, 171 (06) : E111 - E111
  • [43] Machine learning approaches to genome-wide association studies
    Enoma, David O.
    Bishung, Janet
    Abiodun, Theresa
    Ogunlana, Olubanke
    Osamor, Victor Chukwudi
    JOURNAL OF KING SAUD UNIVERSITY SCIENCE, 2022, 34 (04)
  • [44] Power analysis for genome-wide association studies
    Klein, Robert J.
    BMC GENETICS, 2007, 8 (1)
  • [45] Editorial: Machine Learning in Genome-Wide Association Studies
    Hu, Ting
    Darabos, Christian
    Urbanowicz, Ryan
    FRONTIERS IN GENETICS, 2020, 11
  • [46] Genome-wide linkage and genome-wide association -: Can they be reconciled?
    Mueller-Myhsok, Bertram
    ANNALS OF HUMAN GENETICS, 2008, 72 : 687 - 687
  • [47] USE OF GENOME-WIDE ASSOCIATION STUDIES IN SELECTION OF CANDIDATE SNPS
    Bolton, J.
    Price, J.
    ATHEROSCLEROSIS SUPPLEMENTS, 2009, 10 (02)
  • [48] Genome-wide selection and association in animal breeding using ssGBLUP
    Nedel Pertile, Simone Fernanda
    Fonseca e Silva, Fabyano
    Salvian, Mayara
    Mourao, Gerson Barreto
    PESQUISA AGROPECUARIA BRASILEIRA, 2016, 51 (10) : 1729 - 1736
  • [49] SNP Selection Strategies from Genome-Wide Association Studies
    Sinnwell, J. P.
    Schaid, D. J.
    GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 714 - 714
  • [50] Population Substructure and Control Selection in Genome-Wide Association Studies
    Yu, Kai
    Wang, Zhaoming
    Li, Qizhai
    Wacholder, Sholom
    Hunter, David J.
    Hoover, Robert N.
    Chanock, Stephen
    Thomas, Gilles
    PLOS ONE, 2008, 3 (07):