Estimating large-scale signaling networks through nested effect models with intervention effects from microarray data

被引:31
作者
Froehlich, Holger [1 ]
Fellmann, Mark [1 ]
Sueltmann, Holger [1 ]
Poustka, Annemarie [1 ]
Beissbarth, Tim [1 ]
机构
[1] German Canc Res Ctr, D-69120 Heidelberg, Germany
关键词
D O I
10.1093/bioinformatics/btm634
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Targeted interventions using RNA interference in combination with the measurement of secondary effects with DNA microarrays can be used to computationally reverse engineer features of upstream non-transcriptional signaling cascades based on the nested structure of effects. Results: We extend previous work by Markowetz et al., who proposed a statistical framework to score different network hypotheses. Our extensions go in several directions: we show how prior assumptions on the network structure can be incorporated into the scoring scheme by defining appropriate prior distributions on the network structure as well as on hyperparameters. An approach called module networks is introduced to scale up the original approach, which is limited to around 5 genes, to infer large-scale networks of more than 30 genes. Instead of the data discretization step needed in the original framework, we propose the usage of a beta-uniform mixture distribution on the P-value profile, resulting from differential gene expression calculation, to quantify effects. Extensive simulations on artificial data and application of our module network approach to infer the signaling network between 13 genes in the ER-alpha pathway in human MCF-7 breast cancer cells show that our approach gives sensible results. Using a bootstrapping and a jackknife approach, this reconstruction is found to be statistically stable.
引用
收藏
页码:2650 / 2656
页数:7
相关论文
共 11 条
[1]  
Aho A. V., 1972, SIAM Journal on Computing, V1, P131, DOI 10.1137/0201008
[2]   Sequential activation of signaling pathways during innate immune responses in Drosophila [J].
Boutros, M ;
Agaisse, H ;
Perrimon, N .
DEVELOPMENTAL CELL, 2002, 3 (05) :711-722
[3]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[4]  
FROHLICH H, 2007, P GERMAN C BIOINFORM, P45
[5]   GOSim -: an R-package for computation of information theoretic GO similarities between terms and gene products [J].
Frohlich, Holger ;
Speer, Nora ;
Poustka, Annemarie ;
Beissarth, Tim .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]  
Huber Wolfgang, 2002, Bioinformatics, V18 Suppl 1, pS96
[7]   Combining microarrays and biological knowledge for estimating gene networks via Bayesian networks [J].
Imoto, S ;
Higuchi, T ;
Goto, T ;
Tashiro, K ;
Kuhara, S ;
Miyano, S .
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, :104-113
[8]   Non-transcriptional pathway features reconstructed from secondary effects of RNA interference [J].
Markowetz, F ;
Bloch, J ;
Spang, R .
BIOINFORMATICS, 2005, 21 (21) :4026-4032
[9]   Nested effects models for high-dimensional phenotyping screens [J].
Markowetz, Florian ;
Kostka, Dennis ;
Troyanskaya, Olga G. ;
Spang, Rainer .
BIOINFORMATICS, 2007, 23 (13) :I305-I312
[10]   Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-values [J].
Pounds, S ;
Morris, SW .
BIOINFORMATICS, 2003, 19 (10) :1236-1242