Detection of SNP-SNP Interactions in Genome-wide Association Data Using Random Forests and Association Rules

被引:0
|
作者
Tung Nguyen [1 ]
Ly Le [2 ]
机构
[1] Thuyloi Univ, Fac Comp Sci & Engn, 175 Tay Son, Hanoi, Vietnam
[2] Vietnam Natl Univ, Int Univ, Sch Biotechnol, Ho Chi Minh City 700000, Vietnam
来源
2018 12TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA) | 2018年
关键词
Genome-wide association studies; Data Mining; Random Forests; SNP-SNP interactions; IDENTIFICATION; PATTERNS; LOCI;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The primary goal of genome-wide association studies (GWAS) is to discover genes or variants associated with complex diseases. Most GWA studies use single SNP (single nucleotide polymorphism) approaches that mainly focused on assessing the association between each individual SNP and disease; therefore they cannot take into account the combinations of SNPs. However, complex diseases are thought to involve complex etiologies including complicated interactions between many SNPs. Thus, different approaches are necessary to identify SNPs that influence disease risk jointly or in complex interactions. To discover SNP-SNP interactions, in this paper we propose first to use an improvement of Random Forest algorithm tailored for structured GWAS data, all rules are then extracted from the trees to analyse SNPs interactions. Our method allows one to select subgroups of informative SNPs which are most relevant to disease for building accurate decision trees and then we enable educe SNPs interactions from these trees. By this way, it reduces the dimensionality and can perform well with high-dimensional SNPs data sets. We conducted experiments on two genome-wide SNP data sets to demonstrate the effectiveness of the method for the SNP-SNP interactions.
引用
收藏
页码:32 / +
页数:7
相关论文
共 50 条
  • [1] Review on GPU accelerated methods for genome-wide SNP-SNP interactions
    Ren, Wenlong
    Liang, Zhikai
    MOLECULAR GENETICS AND GENOMICS, 2025, 300 (01)
  • [2] SNP Selection and Classification of Genome-Wide SNP Data Using Stratified Sampling Random Forests
    Wu, Qingyao
    Ye, Yunming
    Liu, Yang
    Ng, Michael K.
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2012, 11 (03) : 216 - 227
  • [3] IndOR: a new statistical procedure to test for SNP-SNP epistasis in genome-wide association studies
    Emily, M.
    STATISTICS IN MEDICINE, 2012, 31 (21) : 2359 - 2373
  • [4] An overview of SNP interactions in genome-wide association studies
    Li, Pei
    Guo, Maozu
    Wang, Chunyu
    Liu, Xiaoyan
    Zou, Quan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (02) : 143 - 155
  • [5] Identification of genome-wide SNP-SNP interactions associated with important traits in chicken
    Hui Zhang
    Jia-Qiang Yu
    Li-Li Yang
    Luke M. Kramer
    Xin-Yang Zhang
    Wei Na
    James M. Reecy
    Hui Li
    BMC Genomics, 18
  • [6] Identification of genome-wide SNP-SNP interactions associated with important traits in chicken
    Zhang, Hui
    Yu, Jia-Qiang
    Yang, Li-Li
    Kramer, Luke M.
    Zhang, Xin-Yang
    Na, Wei
    Reecy, James M.
    Li, Hui
    BMC GENOMICS, 2017, 18
  • [7] How Genome-Wide SNP-SNP Interactions Relate to Nasopharyngeal Carcinoma Susceptibility
    Su, Wen-Hui
    Shugart, Yin Yao
    Chang, Kai-Ping
    Tsang, Ngan-Ming
    Tse, Ka-Po
    Chang, Yu-Sun
    PLOS ONE, 2013, 8 (12):
  • [8] Ultrafast genome-wide scan for SNP-SNP interactions in common complex disease
    Prabhu, Snehit
    Pe'er, Itsik
    GENOME RESEARCH, 2012, 22 (11) : 2230 - 2240
  • [9] GPU-based Genome-Wide SNP-SNP Interactions Detection and Pancreatic Cancer Susceptibility Analysis
    Wang, Xiao
    Peng, Qinke
    Fan, Yue
    Wang, Ying
    2016 IEEE EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES), 2016, : 57 - 61
  • [10] Exhaustive Genome-Wide Search for SNP-SNP Interactions Across 10 Human Diseases
    Murk, William
    DeWan, Andrew T.
    G3-GENES GENOMES GENETICS, 2016, 6 (07): : 2043 - 2050