Information theoretic approaches for inference of biological networks from continuous-valued data

被引:13
作者
Budden, David M. [1 ,2 ]
Crampin, Edmund J. [2 ,3 ,4 ,5 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[2] Univ Melbourne, Melbourne Sch Engn, Syst Biol Lab, Parkville, Vic 3010, Australia
[3] ARC Ctr Excellence Convergent Bionano Sci & Techn, Parkville, Vic 3010, Australia
[4] Univ Melbourne, Dept Math & Stat, Parkville, Vic 3010, Australia
[5] Univ Melbourne, Sch Med, Parkville, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
Gene regulatory network; Transcriptional regulation; Gene expression; REGULATORY NETWORKS; GENE; ENTROPY; DYNAMICS; PACKAGE;
D O I
10.1186/s12918-016-0331-y
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Characterising programs of gene regulation by studying individual protein-DNA and protein-protein interactions would require a large volume of high-resolution proteomics data, and such data are not yet available. Instead, many gene regulatory network (GRN) techniques have been developed, which leverage the wealth of transcriptomic data generated by recent consortia to study indirect, gene-level relationships between transcriptional regulators. Despite the popularity of such methods, previous methods of GRN inference exhibit limitations that we highlight and address through the lens of information theory. Results: We introduce new model-free and non-linear information theoretic measures for the inference of GRNs and other biological networks from continuous-valued data. Although previous tools have implemented mutual information as a means of inferring pairwise associations, they either introduce statistical bias through discretisation or are limited to modelling undirected relationships. Our approach overcomes both of these limitations, as demonstrated by a substantial improvement in empirical performance for a set of 160 GRNs of varying size and topology. Conclusions: The information theoretic measures described in this study yield substantial improvements over previous approaches (e.g. ARACNE) and have been implemented in the latest release of NAIL (Network Analysis and Inference Library). However, despite the theoretical and empirical advantages of these new measures, they do not circumvent the fundamental limitation of indeterminacy exhibited across this class of biological networks. These methods have presently found value in computational neurobiology, and will likely gain traction for GRN analysis as the volume and quality of temporal transcriptomics data continues to improve.
引用
收藏
页数:7
相关论文
共 59 条
[1]  
[Anonymous], 1959, Publicationes Mathematicae
[2]   How to infer gene networks from expression profiles [J].
Bansal, Mukesh ;
Belcastro, Vincenzo ;
Ambesi-Impiombato, Alberto ;
di Bernardo, Diego .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[3]   Continuous representations of time-series gene expression data [J].
Bar-Joseph, Z ;
Gerber, GK ;
Gifford, DK ;
Jaakkola, TS ;
Simon, I .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (3-4) :341-356
[4]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[5]   Network medicine: a network-based approach to human disease [J].
Barabasi, Albert-Laszlo ;
Gulbahce, Natali ;
Loscalzo, Joseph .
NATURE REVIEWS GENETICS, 2011, 12 (01) :56-68
[6]   Transfer Entropy as a Log-Likelihood Ratio [J].
Barnett, Lionel ;
Bossomaier, Terry .
PHYSICAL REVIEW LETTERS, 2012, 109 (13)
[7]   Granger Causality and Transfer Entropy Are Equivalent for Gaussian Variables [J].
Barnett, Lionel ;
Barrett, Adam B. ;
Seth, Anil K. .
PHYSICAL REVIEW LETTERS, 2009, 103 (23)
[8]  
Beaudry NJ, 2012, QUANTUM INF COMPUT, V12, P432
[9]   Information processing in echo state networks at the edge of chaos [J].
Boedecker, Joschka ;
Obst, Oliver ;
Lizier, Joseph T. ;
Mayer, N. Michael ;
Asada, Minoru .
THEORY IN BIOSCIENCES, 2012, 131 (03) :205-213
[10]   Predictive modelling of gene expression from transcriptional regulatory elements [J].
Budden, David M. ;
Hurley, Daniel G. ;
Crampin, Edmund J. .
BRIEFINGS IN BIOINFORMATICS, 2015, 16 (04) :616-628