iRegulon: From a Gene List to a Gene Regulatory Network Using Large Motif and Track Collections

被引:641
作者
Janky, Rekin's [1 ]
Verfaillie, Annelien [1 ]
Imrichova, Hana [1 ]
Van de Sande, Bram [1 ]
Standaert, Laura [2 ,3 ]
Christiaens, Valerie [1 ]
Hulselmans, Gert [1 ]
Herten, Koen [1 ]
Sanchez, Marina Naval [1 ]
Potier, Delphine [1 ]
Svetlichnyy, Dmitry [1 ]
Atak, Zeynep Kalender [1 ]
Fiers, Mark [3 ]
Marine, Jean-Christophe [2 ,3 ]
Aerts, Stein [1 ]
机构
[1] Katholieke Univ Leuven, Ctr Human Genet, Lab Computat Biol, Leuven, Belgium
[2] Katholieke Univ Leuven, Ctr Human Genet, Lab Mol Canc Biol, Leuven, Belgium
[3] VIB Ctr Biol Dis, Lab Mol Canc Biol, Leuven, Belgium
关键词
TRANSCRIPTION FACTOR-BINDING; INTEGRATIVE GENOMICS VIEWER; TARGET GENES; NF-Y; WIDE IDENTIFICATION; DENSE CLUSTERS; SITE MOTIFS; DNA; ELEMENTS; EXPRESSION;
D O I
10.1371/journal.pcbi.1003731
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Identifying master regulators of biological processes and mapping their downstream gene networks are key challenges in systems biology. We developed a computational method, called iRegulon, to reverse-engineer the transcriptional regulatory network underlying a co-expressed gene set using cis-regulatory sequence analysis. iRegulon implements a genome-wide ranking-and-recovery approach to detect enriched transcription factor motifs and their optimal sets of direct targets. We increase the accuracy of network inference by using very large motif collections of up to ten thousand position weight matrices collected from various species, and linking these to candidate human TFs via a motif2TF procedure. We validate iRegulon on gene sets derived from ENCODE ChIP-seq data with increasing levels of noise, and we compare iRegulon with existing motif discovery methods. Next, we use iRegulon on more challenging types of gene lists, including microRNA target sets, protein-protein interaction networks, and genetic perturbation data. In particular, we over-activate p53 in breast cancer cells, followed by RNA-seq and ChIP-seq, and could identify an extensive up-regulated network controlled directly by p53. Similarly we map a repressive network with no indication of direct p53 regulation but rather an indirect effect via E2F and NFY. Finally, we generalize our computational framework to include regulatory tracks such as ChIP-seq data and show how motif and track discovery can be combined to map functional regulatory interactions among co-expressed genes. iRegulon is available as a Cytoscape plugin from http://iregulon.aertslab.org.
引用
收藏
页数:19
相关论文
共 137 条
[1]   Gene prioritization through genomic data fusion [J].
Aerts, S ;
Lambrechts, D ;
Maity, S ;
Van Loo, P ;
Coessens, B ;
De Smet, F ;
Tranchevent, LC ;
De Moor, B ;
Marynen, P ;
Hassan, B ;
Carmeliet, P ;
Moreau, Y .
NATURE BIOTECHNOLOGY, 2006, 24 (05) :537-544
[2]   Fine-Tuning Enhancer Models to Predict Transcriptional Targets across Multiple Genomes [J].
Aerts, Stein ;
van Helden, Jacques ;
Sand, Olivier ;
Hassan, Bassem A. .
PLOS ONE, 2007, 2 (11)
[3]   COMPUTATIONAL STRATEGIES FOR THE GENOME-WIDE IDENTIFICATION OF CIS-REGULATORY ELEMENTS AND TRANSCRIPTIONAL TARGETS [J].
Aerts, Stein .
TRANSCRIPTIONAL SWITCHES DURING DEVELOPMENT, 2012, 98 :121-145
[4]   Robust Target Gene Discovery through Transcriptome Perturbations and Genome-Wide Enhancer Predictions in Drosophila Uncovers a Regulatory Basis for Sensory Specification [J].
Aerts, Stein ;
Quan, Xiao-Jiang ;
Claeys, Annelies ;
Sanchez, Marina Naval ;
Tate, Phillip ;
Yan, Jiekun ;
Hassan, Bassem A. .
PLOS BIOLOGY, 2010, 8 (07)
[5]  
Anders S., 2010, GENOME BIOL, V11, pR106, DOI [10.1186/gb-2010-11-10-r106, DOI 10.1186/gb-2010-11-10-r106]
[6]  
[Anonymous], NUCLEIC ACIDS RES
[7]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[8]   Relating genes to function: identifying enriched transcription factors using the ENCODE ChIP-Seq significance tool [J].
Auerbach, Raymond K. ;
Chen, Bin ;
Butte, Atul J. .
BIOINFORMATICS, 2013, 29 (15) :1922-1924
[9]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[10]   Rewiring of Genetic Networks in Response to DNA Damage [J].
Bandyopadhyay, Sourav ;
Mehta, Monika ;
Kuo, Dwight ;
Sung, Min-Kyung ;
Chuang, Ryan ;
Jaehnig, Eric J. ;
Bodenmiller, Bernd ;
Licon, Katherine ;
Copeland, Wilbert ;
Shales, Michael ;
Fiedler, Dorothea ;
Dutkowski, Janusz ;
Guenole, Aude ;
van Attikum, Haico ;
Shokat, Kevan M. ;
Kolodner, Richard D. ;
Huh, Won-Ki ;
Aebersold, Ruedi ;
Keogh, Michael-Christopher ;
Krogan, Nevan J. ;
Ideker, Trey .
SCIENCE, 2010, 330 (6009) :1385-1389