Reconstructing biological networks using conditional correlation analysis

被引:84
作者
Rice, JJ [1 ]
Tu, Y [1 ]
Stolovitzky, G [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Computat Biol Ctr, Yorktown Hts, NY 10598 USA
关键词
D O I
10.1093/bioinformatics/bti064
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One of the present challenges in biological research is the organization of the data originating from high-throughput technologies. One way in which this information can be organized is in the form of networks of influences, physical or statistical, between cellular components. We propose an experimental method for probing biological networks, analyzing the resulting data and reconstructing the network architecture. Methods: We use networks of known topology consisting of nodes (genes), directed edges (gene-gene interactions) and a dynamics for the genes' mRNA concentrations in terms of the gene-gene interactions. We proposed a network reconstruction algorithm based on the conditional correlation of the mRNA equilibrium concentration between two genes given that one of them was knocked down. Using simulated gene expression data on networks of known connectivity, we investigated how the reconstruction error is affected by noise, network topology, size, sparseness and dynamic parameters. Results: Errors arise from correlation between nodes connected through intermediate nodes (false positives) and when the correlation between two directly connected nodes is obscured by noise, non-linearity or multiple inputs to the target node (false negatives). Two critical components of the method are as follows: (1) the choice of an optimal correlation threshold for predicting connections and (2) the reduction of errors arising from indirect connections (for which a novel algorithm is proposed). With these improvements, we can reconstruct networks with the topology of the transcriptional regulatory network in Escherichia coli with a reasonably low error rate.
引用
收藏
页码:765 / 773
页数:9
相关论文
共 38 条
[21]  
Liang S, 1998, Pac Symp Biocomput, P18
[22]   Small interfering RNA production by enzymatic engineering of DNA (SPEED) [J].
Luo, B ;
Heard, AD ;
Lodish, HF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (15) :5494-5499
[23]   It's a noisy business! Genetic regulation at the nanomolar scale [J].
McAdams, HH ;
Arkin, A .
TRENDS IN GENETICS, 1999, 15 (02) :65-69
[24]   Network motifs: Simple building blocks of complex networks [J].
Milo, R ;
Shen-Orr, S ;
Itzkovitz, S ;
Kashtan, N ;
Chklovskii, D ;
Alon, U .
SCIENCE, 2002, 298 (5594) :824-827
[25]   A resource for large-scale RNA-interference-based screens in mammals [J].
Paddison, PJ ;
Silva, JM ;
Conklin, DS ;
Schlabach, M ;
Li, MM ;
Aruleba, S ;
Balija, V ;
O'Shaughnessy, A ;
Gnoj, L ;
Scobie, K ;
Chang, K ;
Westbrook, T ;
Cleary, M ;
Sachidanandam, R ;
McCombie, WR ;
Elledge, SJ ;
Hannon, GJ .
NATURE, 2004, 428 (6981) :427-431
[26]  
Rice J. Jeremy, 2004, Biosilico, V2, P70
[27]   Assigning numbers to the arrows: Parameterizing a gene regulation network by using accurate expression kinetics [J].
Ronen, M ;
Rosenberg, R ;
Shraiman, BI ;
Alon, U .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (16) :10555-10560
[28]   Negative autoregulation speeds the response times of transcription networks [J].
Rosenfeld, N ;
Elowitz, MB ;
Alon, U .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 323 (05) :785-793
[29]   On the deduction of chemical reaction pathways from measurements of time series of concentrations [J].
Samoilov, M ;
Arkin, A ;
Ross, J .
CHAOS, 2001, 11 (01) :108-114
[30]   Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data [J].
Segal, E ;
Shapira, M ;
Regev, A ;
Pe'er, D ;
Botstein, D ;
Koller, D ;
Friedman, N .
NATURE GENETICS, 2003, 34 (02) :166-176