TIGRESS: Trustful Inference of Gene REgulation using Stability Selection

被引:289
作者
Haury, Anne-Claire [1 ,2 ,3 ]
Mordelet, Fantine [4 ]
Vera-Licona, Paola [1 ,2 ,3 ]
Vert, Jean-Philippe [1 ,2 ,3 ]
机构
[1] Mines ParisTech, Ctr Computat Biol, F-77300 Fontainebleau, France
[2] Inst Curie, F-75248 Paris, France
[3] INSERM, U900, F-75248 Paris, France
[4] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
基金
欧洲研究理事会;
关键词
Gene Regulatory Network inference; Feature selection; Gene expression data; LARS; Stability selection; INFERRING CELLULAR NETWORKS; TRANSCRIPTIONAL REGULATION; COMPOUND-MODE; EXPRESSION; ARACNE;
D O I
10.1186/1752-0509-6-145
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Inferring the structure of gene regulatory networks (GRN) from a collection of gene expression data has many potential applications, from the elucidation of complex biological processes to the identification of potential drug targets. It is however a notoriously difficult problem, for which the many existing methods reach limited accuracy. Results: In this paper, we formulate GRN inference as a sparse regression problem and investigate the performance of a popular feature selection method, least angle regression (LARS) combined with stability selection, for that purpose. We introduce a novel, robust and accurate scoring technique for stability selection, which improves the performance of feature selection with LARS. The resulting method, which we call TIGRESS (for Trustful Inference of Gene REgulation with Stability Selection), was ranked among the top GRN inference methods in the DREAM5 gene network inference challenge. In particular, TIGRESS was evaluated to be the best linear regression-based method in the challenge. We investigate in depth the influence of the various parameters of the method, and show that a fine parameter tuning can lead to significant improvements and state-of-the-art performance for GRN inference, in both directed and undirected settings. Conclusions: TIGRESS reaches state-of-the-art performance on benchmark data, including both in silico and in vivo (E. coli and S. cerevisiae) networks. This study confirms the potential of feature selection techniques for GRN inference. Code and data are available on http://cbio.ensmp.fr/tigress. Moreover, TIGRESS can be run online through the GenePattern platform (GP-DREAM, http://dream.broadinstitute.org).
引用
收藏
页数:17
相关论文
共 38 条
[1]   Algorithms for identifying Boolean networks and related biological networks based on matrix multiplication and fingerprint function [J].
Akutsu, T ;
Miyano, S ;
Kuhara, S .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) :331-343
[2]   A test case of correlation metric construction of a reaction pathway from measurements [J].
Arkin, A ;
Shen, PD ;
Ross, J .
SCIENCE, 1997, 277 (5330) :1275-1279
[3]  
Bach F. R., 2008, P 25 INT C MACH LEAR, P33, DOI DOI 10.1145/1390156.1390161
[4]   Inference of gene regulatory networks and compound mode of action from time course gene expression profiles [J].
Bansal, M ;
Della Gatta, G ;
di Bernardo, D .
BIOINFORMATICS, 2006, 22 (07) :815-822
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Discovering functional relationships between RNA expression and chemotherapeutic susceptibility using relevance networks [J].
Butte, AJ ;
Tamayo, P ;
Slonim, D ;
Golub, TR ;
Kohane, IS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) :12182-12186
[7]   A stochastic differential equation model for quantifying transcriptional regulatory network in Saccharomyces cerevisiae [J].
Chen, KC ;
Wang, TY ;
Tseng, HH ;
Huang, CYF ;
Kao, CY .
BIOINFORMATICS, 2005, 21 (12) :2883-2890
[8]  
Chen T, 1999, Pac Symp Biocomput, P29
[9]   Chemogenomic profiling on a genomewide scale using reverse-engineered gene networks [J].
di Bernardo, D ;
Thompson, MJ ;
Gardner, TS ;
Chobot, SE ;
Eastwood, EL ;
Wojtovich, AP ;
Elliott, SJ ;
Schaus, SE ;
Collins, JJ .
NATURE BIOTECHNOLOGY, 2005, 23 (03) :377-383
[10]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499