Encore: Genetic Association Interaction Network Centrality Pipeline and Application to SLE Exome Data

被引:20
作者
Davis, Nicholas A. [1 ]
Lareau, Caleb A. [2 ,3 ]
White, Bill C. [1 ]
Pandey, Ahwan [1 ]
Wiley, Graham [3 ]
Montgomery, Courtney G. [3 ]
Gaffney, Patrick M. [3 ]
McKinney, B. A. [1 ,2 ]
机构
[1] Univ Tulsa, Tandy Sch Comp Sci, Tulsa, OK 74104 USA
[2] Univ Tulsa, Dept Math, Tulsa, OK 74104 USA
[3] Oklahoma Med Res Fdn, Arthrit & Clin Immunol Res Program, Oklahoma City, OK 73104 USA
关键词
epistasis network; machine learning; network analysis; network centrality; Systemic Lupus Erythematous; GENOME-WIDE ASSOCIATION; ANTIBODY-RESPONSE; POLYSIALIC ACID; POPULATION; EXPRESSION; DISEASE; MODELS; TOOL;
D O I
10.1002/gepi.21739
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Open source tools are needed to facilitate the construction, analysis, and visualization of gene-gene interaction networks for sequencing data. To address this need, we present Encore, an open source network analysis pipeline for genome-wide association studies and rare variant data. Encore constructs Genetic Association Interaction Networks or epistasis networks using two optional approaches: our previous information-theory method or a generalized linear model approach. Additionally, Encore includes multiple data filtering options, including Random Forest/Random Jungle for main effect enrichment and Evaporative Cooling and Relief-F filters for enrichment of interaction effects. Encore implements SNPrank network centrality for identifying susceptibility hubs (nodes containing a large amount of disease susceptibility information through the combination of multivariate main effects and multiple gene-gene interactions in the network), and it provides appropriate files for interactive visualization of a network using tools from our online Galaxy instance. We implemented these algorithms in C++ using OpenMP for shared-memory parallel analysis on a server or desktop. To demonstrate Encore's utility in analysis of genetic sequencing data, we present an analysis of exome resequencing data from healthy individuals and those with Systemic Lupus Erythematous (SLE). Our results verify the importance of the previously associated SLE genes HLA-DRB and NCF2, and these two genes had the highest gene-gene interaction degrees among the susceptibility hubs. An additional 14 genes previously associated with SLE emerged in our epistasis network model of the exome data, and three novel candidate genes, ST8SIA4, CMTM4, and C2CD4B, were implicated in the model. In summary, we present a comprehensive tool for epistasis network analysis and the first such analysis of exome data from a genetic study of SLE.
引用
收藏
页码:614 / 621
页数:8
相关论文
共 32 条
  • [31] Signaling by IL-12 and IL-23 and the immunoregulatory roles of STAT4
    Watford, WT
    Hissong, BD
    Bream, JH
    Kanno, Y
    Muul, L
    O'Shea, JJ
    [J]. IMMUNOLOGICAL REVIEWS, 2004, 202 : 139 - 156
  • [32] Association of TBX21 gene haplotypes in a Chinese population with systemic lupus erythematosus
    You, Y.
    Zhao, W.
    Chen, S.
    Tan, W.
    Dan, Y.
    Hao, F.
    Deng, G.
    [J]. SCANDINAVIAN JOURNAL OF RHEUMATOLOGY, 2010, 39 (03) : 254 - 258