Inferring direct DNA binding from ChIP-seq

被引:337
作者
Bailey, Timothy L. [1 ]
Machanick, Philip [2 ]
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia
[2] Rhodes Univ, Dept Comp Sci, ZA-6140 Grahamstown, South Africa
基金
美国国家卫生研究院;
关键词
TRANSCRIPTION-FACTOR; MOTIF DISCOVERY; COREGULATED GENES; SITES; GENOME; IDENTIFICATION; REPRESENTATION; SEQUENCES; MODEL; SETS;
D O I
10.1093/nar/gks433
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide binding data from transcription factor ChIP-seq experiments is the best source of information for inferring the relative DNA-binding affinity of these proteins in vivo. However, standard motif enrichment analysis and motif discovery approaches sometimes fail to correctly identify the binding motif for the ChIP-ed factor. To overcome this problem, we propose 'central motif enrichment analysis' (CMEA), which is based on the observation that the positional distribution of binding sites matching the direct-binding motif tends to be unimodal, well centered and maximal in the precise center of the ChIP-seq peak regions. We describe a novel visualization and statistical analysis tool-CentriMo-that identifies the region of maximum central enrichment in a set of ChIP-seq peak regions and displays the positional distributions of predicted sites. Using CentriMo for motif enrichment analysis, we provide evidence that one transcription factor (Nanog) has different binding affinity in vivo than in vitro, that another binds DNA cooperatively (E2f1), and confirm the in vivo affinity of NFIC, rescuing a difficult ChIP-seq data set. In another data set, CentriMo strongly suggests that there is no evidence of direct DNA binding by the ChIP-ed factor (Smad1). CentriMo is now part of the MEME Suite software package available at ext-link-type="uri" xlink:href="http://meme.nbcr.net" xmlns:xlink="http://www.w3.org/1999/xlink">http://meme.nbcr.net. All data and output files presented here are available at: ext-link-type="uri" xlink:href="http://research.imb.uq.edu.au/t.bailey/sd/Bailey2011a" xmlns:xlink="http://www.w3.org/1999/xlink">http://research.imb.uq.edu.au/t.bailey/sd/Bailey2011a.
引用
收藏
页数:10
相关论文
共 32 条
[1]  
Bailey T L, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P21
[2]   DREME: motif discovery in transcription factor ChIP-seq data [J].
Bailey, Timothy L. .
BIOINFORMATICS, 2011, 27 (12) :1653-1659
[3]   Universal protein-binding microarrays for the comprehensive characterization of the DNA-binding specificities of transcription factors [J].
Berger, Michael F. ;
Bulyk, Martha L. .
NATURE PROTOCOLS, 2009, 4 (03) :393-411
[4]   Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome [J].
Bieda, M ;
Xu, XQ ;
Singer, MA ;
Green, R ;
Farnham, PJ .
GENOME RESEARCH, 2006, 16 (05) :595-605
[5]   Integration of external signaling pathways with the core transcriptional network in embryonic stem cells [J].
Chen, Xi ;
Xu, Han ;
Yuan, Ping ;
Fang, Fang ;
Huss, Mikael ;
Vega, Vinsensius B. ;
Wong, Eleanor ;
Orlov, Yuriy L. ;
Zhang, Weiwei ;
Jiang, Jianming ;
Loh, Yuin-Han ;
Yeo, Hock Chuan ;
Yeo, Zhen Xuan ;
Narang, Vipin ;
Govindarajan, Kunde Ramamoorthy ;
Leong, Bernard ;
Shahab, Atif ;
Ruan, Yijun ;
Bourque, Guillaume ;
Sung, Wing-Kin ;
Clarke, Neil D. ;
Wei, Chia-Lin ;
Ng, Huck-Hui .
CELL, 2008, 133 (06) :1106-1117
[6]   Trawler:: de novo regulatory motif discovery pipeline for chromatin immunoprecipitation [J].
Ettwiller, Laurence ;
Paten, Benedict ;
Ramialison, Mirana ;
Birney, Ewan ;
Wittbrodt, Joachim .
NATURE METHODS, 2007, 4 (07) :563-565
[7]   Detection of functional DNA motifs via statistical over-representation [J].
Frith, MC ;
Fu, YT ;
Yu, LQ ;
Chen, JF ;
Hansen, U ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2004, 32 (04) :1372-1381
[8]   Integrative Analysis of the Caenorhabditis elegans Genome by the modENCODE Project [J].
Gerstein, Mark B. ;
Lu, Zhi John ;
Van Nostrand, Eric L. ;
Cheng, Chao ;
Arshinoff, Bradley I. ;
Liu, Tao ;
Yip, Kevin Y. ;
Robilotto, Rebecca ;
Rechtsteiner, Andreas ;
Ikegami, Kohta ;
Alves, Pedro ;
Chateigner, Aurelien ;
Perry, Marc ;
Morris, Mitzi ;
Auerbach, Raymond K. ;
Feng, Xin ;
Leng, Jing ;
Vielle, Anne ;
Niu, Wei ;
Rhrissorrakrai, Kahn ;
Agarwal, Ashish ;
Alexander, Roger P. ;
Barber, Galt ;
Brdlik, Cathleen M. ;
Brennan, Jennifer ;
Brouillet, Jeremy Jean ;
Carr, Adrian ;
Cheung, Ming-Sin ;
Clawson, Hiram ;
Contrino, Sergio ;
Dannenberg, Luke O. ;
Dernburg, Abby F. ;
Desai, Arshad ;
Dick, Lindsay ;
Dose, Andrea C. ;
Du, Jiang ;
Egelhofer, Thea ;
Ercan, Sevinc ;
Euskirchen, Ghia ;
Ewing, Brent ;
Feingold, Elise A. ;
Gassmann, Reto ;
Good, Peter J. ;
Green, Phil ;
Gullier, Francois ;
Gutwein, Michelle ;
Guyer, Mark S. ;
Habegger, Lukas ;
Han, Ting ;
Henikoff, Jorja G. .
SCIENCE, 2010, 330 (6012) :1775-1787
[9]   A Biophysical Model for Analysis of Transcription Factor Interaction and Binding Site Arrangement from Genome-Wide Binding Data [J].
He, Xin ;
Chen, Chieh-Chun ;
Hong, Feng ;
Fang, Fang ;
Sinha, Saurabh ;
Ng, Huck-Hui ;
Zhong, Sheng .
PLOS ONE, 2009, 4 (12)
[10]   Crystal structure and DNA binding of the homeodomain of the stem cell transcription factor Nanog [J].
Jauch, Ralf ;
Ng, Calista Keow Leng ;
Saikatendu, Kumar Singh ;
Stevens, Raymond G. ;
Kolatkar, Prasanna R. .
JOURNAL OF MOLECULAR BIOLOGY, 2008, 376 (03) :758-770