Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape

被引:92
作者
Griffon, Aurelien [1 ,2 ]
Barbier, Quentin [1 ,2 ]
Dalino, Jordi [1 ,2 ]
van Helden, Jacques [1 ,2 ]
Spicuglia, Salvatore [1 ,2 ]
Ballester, Benoit [1 ,2 ]
机构
[1] INSERM, TAGC UMR1090, F-13288 Marseille, France
[2] Aix Marseille Univ, TAGC UMR1090, F-13288 Marseille, France
关键词
TRANSCRIPTION FACTORS; HISTONE MODIFICATIONS; FACTOR COLOCALIZATION; SUPER-ENHANCERS; CELL IDENTITY; DNA-BINDING; EXPRESSION; DATABASE; ANNOTATION; GENOME;
D O I
10.1093/nar/gku1280
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The large collections of ChIP-seq data rapidly accumulating in public data warehouses provide genome-wide binding site maps for hundreds of transcription factors (TFs). However, the extent of the regulatory occupancy space in the human genome has not yet been fully apprehended by integrating public ChIP-seq data sets and combining it with ENCODE TFs map. To enable genome-wide identification of regulatory elements we have collected, analysed and retained 395 available ChIP-seq data sets merged with ENCODE peaks covering a total of 237 TFs. This enhanced repertoire complements and refines current genome-wide occupancy maps by increasing the human genome regulatory search space by 14% compared to ENCODE alone, and also increases the complexity of the regulatory dictionary. As a direct application we used this unified binding repertoire to annotate variant enhancer loci (VELs) from H3K4me1 mark in two cancer cell lines (MCF-7, CRC) and observed enrichments of specific TFs involved in biological key functions to cancer development and proliferation. Those enrichments of TFs within VELs provide a direct annotation of non-coding regions detected in cancer genomes. Finally, full access to this catalogue is available online together with the TFs enrichment analysis tool (http://tagc.univ-mrs.fr/remap/).
引用
收藏
页数:14
相关论文
共 60 条
[1]   Epigenomic Enhancer Profiling Defines a Signature of Colon Cancer [J].
Akhtar-Zaidi, Batool ;
Lari, Richard Cowper-Sal ;
Corradin, Olivia ;
Saiakhova, Alina ;
Bartels, Cynthia F. ;
Balasubramanian, Dheepa ;
Myeroff, Lois ;
Lutterbaugh, James ;
Jarrar, Awad ;
Kalady, Matthew F. ;
Willis, Joseph ;
Moore, Jason H. ;
Tesar, Paul J. ;
Laframboise, Thomas ;
Markowitz, Sanford ;
Lupien, Mathieu ;
Scacheri, Peter C. .
SCIENCE, 2012, 336 (6082) :736-739
[2]   An atlas of active enhancers across human cell types and tissues [J].
Andersson, Robin ;
Gebhard, Claudia ;
Miguel-Escalada, Irene ;
Hoof, Ilka ;
Bornholdt, Jette ;
Boyd, Mette ;
Chen, Yun ;
Zhao, Xiaobei ;
Schmidl, Christian ;
Suzuki, Takahiro ;
Ntini, Evgenia ;
Arner, Erik ;
Valen, Eivind ;
Li, Kang ;
Schwarzfischer, Lucia ;
Glatz, Dagmar ;
Raithel, Johanna ;
Lilje, Berit ;
Rapin, Nicolas ;
Bagger, Frederik Otzen ;
Jorgensen, Mette ;
Andersen, Peter Refsing ;
Bertin, Nicolas ;
Rackham, Owen ;
Burroughs, A. Maxwell ;
Baillie, J. Kenneth ;
Ishizu, Yuri ;
Shimizu, Yuri ;
Furuhata, Erina ;
Maeda, Shiori ;
Negishi, Yutaka ;
Mungall, Christopher J. ;
Meehan, Terrence F. ;
Lassmann, Timo ;
Itoh, Masayoshi ;
Kawaji, Hideya ;
Kondo, Naoto ;
Kawai, Jun ;
Lennartsson, Andreas ;
Daub, Carsten O. ;
Heutink, Peter ;
Hume, David A. ;
Jensen, Torben Heick ;
Suzuki, Harukazu ;
Hayashizaki, Yoshihide ;
Mueller, Ferenc ;
Forrest, Alistair R. R. ;
Carninci, Piero ;
Rehli, Michael ;
Sandelin, Albin .
NATURE, 2014, 507 (7493) :455-+
[3]   NCBI GEO: mining tens of millions of expression profiles - database and tools update [J].
Barrett, Tanya ;
Troup, Dennis B. ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Rudnev, Dmitry ;
Evangelista, Carlos ;
Kim, Irene F. ;
Soboleva, Alexandra ;
Tomashevsky, Maxim ;
Edgar, Ron .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D760-D765
[4]  
Bastian M., ICWSM, Vvol 3, ppp 361, DOI [10.1016/B978-0-12-372180-8.50042-1, DOI 10.1609/ICWSM.V3I1.13937]
[5]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[6]   Discovery and Characterization of Super-Enhancer-Associated Dependencies in Diffuse Large B Cell Lymphoma [J].
Chapuy, Bjoern ;
McKeown, Michael R. ;
Lin, Charles Y. ;
Monti, Stefano ;
Roemer, Margaretha G. M. ;
Qi, Jun ;
Rahl, Peter B. ;
Sun, Heather H. ;
Yeda, Kelly T. ;
Doench, John G. ;
Reichert, Elaine ;
Kung, Andrew L. ;
Rodig, Scott J. ;
Young, Richard A. ;
Shipp, Margaret A. ;
Bradner, James E. .
CANCER CELL, 2013, 24 (06) :777-790
[7]   Computational Identification of Diverse Mechanisms Underlying Transcription Factor-DNA Occupancy [J].
Cheng, Qiong ;
Kazemian, Majid ;
Hannah Pham ;
Blatti, Charles ;
Celniker, Susan E. ;
Wolfe, Scot A. ;
Brodsky, Michael H. ;
Sinha, Saurabh .
PLOS GENETICS, 2013, 9 (08)
[8]   An effective statistical evaluation of ChIPseq dataset similarity [J].
Chikina, Maria D. ;
Troyanskaya, Olga G. .
BIOINFORMATICS, 2012, 28 (05) :607-613
[9]   Functional elements demarcated by histone modifications in breast cancer cells [J].
Choe, Moon Kyung ;
Hong, Chang-Pyo ;
Park, Jihwan ;
Seo, Seong Hye ;
Roh, Tae-Young .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2012, 418 (03) :475-482
[10]   Distribution and intensity of constraint in mammalian genomic sequence [J].
Cooper, GM ;
Stone, EA ;
Asimenos, G ;
Green, ED ;
Batzoglou, S ;
Sidow, A .
GENOME RESEARCH, 2005, 15 (07) :901-913