Elastic-Net Copula Granger Causality for Inference of Biological Networks

被引:11
作者
Furqan, Mohammad Shaheryar [1 ,2 ]
Siyal, Mohammad Yakoob [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[2] Nanyang Technol Univ, Infocomm Ctr Excellence, INFINITUS, Singapore, Singapore
来源
PLOS ONE | 2016年 / 11卷 / 10期
关键词
PARTIAL DIRECTED COHERENCE; GENE; EXPRESSION; MODELS; REGULARIZATION; IDENTIFICATION; CONNECTIVITY; REGRESSION; TOOLBOX; LASSO;
D O I
10.1371/journal.pone.0165612
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Aim In bioinformatics, the inference of biological networks is one of the most active research areas. It involves decoding various complex biological networks that are responsible for performing diverse functions in human body. Among these networks analysis, most of the research focus is towards understanding effective brain connectivity and gene networks in order to cure and prevent related diseases like Alzheimer and cancer respectively. However, with recent advances in data procurement technology, such as DNA microarray analysis and fMRI that can simultaneously process a large amount of data, it yields high-dimensional data sets. These high dimensional dataset analyses possess challenges for the analyst. Background Traditional methods of Granger causality inference use ordinary least-squares methods for structure estimation, which confront dimensionality issues when applied to high-dimensional data. Apart from dimensionality issues, most existing methods were designed to capture only the linear inferences from time series data. Method and Conclusion In this paper, we address the issues involved in assessing Granger causality for both linear and nonlinear high-dimensional data by proposing an elegant form of the existing LASSO-based method that we call "Elastic-Net Copula Granger causality". This method provides a more stable way to infer biological networks which has been verified using rigorous experimentation. We have compared the proposed method with the existing method and demonstrated that this new strategy outperforms the existing method on all measures: precision, false detection rate, recall, and F1 score. We have also applied both methods to real HeLa cell data and StarPlus fMRI datasets and presented a comparison of the effectiveness of both methods.
引用
收藏
页数:16
相关论文
共 59 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] Amblard PO, 2012, MACH LEARN SIGN PROC
  • [3] [Anonymous], 2009, PROGR NATURAL SCI
  • [4] [Anonymous], 2005, NEW INTRO MULTIPLE T
  • [5] [Anonymous], 1993, Nonparametric regression and generalized linear models: A roughness penalty approach
  • [6] Partial directed coherence:: a new concept in neural structure determination
    Baccalá, LA
    Sameshima, K
    [J]. BIOLOGICAL CYBERNETICS, 2001, 84 (06) : 463 - 474
  • [7] Bahadori MT, 2013, 2013 SIAM INT C DAT
  • [8] The MVGC multivariate Granger causality toolbox: A new approach to Granger-causal inference
    Barnett, Lionel
    Seth, Anil K.
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2014, 223 : 50 - 68
  • [9] Chatr-aryamontri A., 2014, NUCL ACIDS RES
  • [10] FBLG: A Simple and Effective Approach for Temporal Dependence Discovery from Time Series Data
    Cheng, Dehua
    Bahadori, Mohammad Taha
    Liu, Yan
    [J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 382 - 391