MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data

被引:5
作者
Ozaki, Haruka [1 ,4 ]
Iwasaki, Wataru [1 ,2 ,3 ]
机构
[1] Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778568, Japan
[2] Univ Tokyo, Grad Sch Sci, Dept Biol Sci, Bunkyo Ku, Hongo 7-3-1, Tokyo 1130032, Japan
[3] Univ Tokyo, Atmosphere & Ocean Res Inst, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778564, Japan
[4] RIKEN, Adv Ctr Comp & Commun, Bioinformat Res Unit, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
关键词
DNA binding motifs; ChIP-Seq; Transcription factors; SERUM RESPONSE FACTOR; TRANSCRIPTION-FACTOR; SEQUENCE; SITES; GENE; CREB; EXPRESSION; DISCOVERY; TRANSACTIVATION; ELEMENTS;
D O I
10.1016/j.compbiolchem.2016.01.014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Results: Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. Conclusions: By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:62 / 72
页数:11
相关论文
共 50 条
[11]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[12]   ChIP-seq and beyond: new and improved methodologies to detect and characterize protein-DNA interactions [J].
Furey, Terrence S. .
NATURE REVIEWS GENETICS, 2012, 13 (12) :840-852
[13]   Evidence-ranked motif identification [J].
Georgiev, Stoyan ;
Boyle, Alan P. ;
Jayasurya, Karthik ;
Ding, Xuan ;
Mukherjee, Sayan ;
Ohler, Uwe .
GENOME BIOLOGY, 2010, 11 (02)
[14]   T-cell expression of the human GATA-3 gene is regulated by a non-lineage-specific silencer [J].
Grégoire, JM ;
Roméo, PH .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1999, 274 (10) :6567-6578
[15]   GENCODE: The reference human genome annotation for The ENCODE Project [J].
Harrow, Jennifer ;
Frankish, Adam ;
Gonzalez, Jose M. ;
Tapanari, Electra ;
Diekhans, Mark ;
Kokocinski, Felix ;
Aken, Bronwen L. ;
Barrell, Daniel ;
Zadissa, Amonida ;
Searle, Stephen ;
Barnes, If ;
Bignell, Alexandra ;
Boychenko, Veronika ;
Hunt, Toby ;
Kay, Mike ;
Mukherjee, Gaurab ;
Rajan, Jeena ;
Despacio-Reyes, Gloria ;
Saunders, Gary ;
Steward, Charles ;
Harte, Rachel ;
Lin, Michael ;
Howald, Cedric ;
Tanzer, Andrea ;
Derrien, Thomas ;
Chrast, Jacqueline ;
Walters, Nathalie ;
Balasubramanian, Suganthi ;
Pei, Baikang ;
Tress, Michael ;
Manuel Rodriguez, Jose ;
Ezkurdia, Iakes ;
van Baren, Jeltje ;
Brent, Michael ;
Haussler, David ;
Kellis, Manolis ;
Valencia, Alfonso ;
Reymond, Alexandre ;
Gerstein, Mark ;
Guigo, Roderic ;
Hubbard, Tim J. .
GENOME RESEARCH, 2012, 22 (09) :1760-1774
[16]   P-value-based regulatory motif discovery using positional weight matrices [J].
Hartmann, Holger ;
Guthoehrlein, Eckhart W. ;
Siebert, Matthias ;
Luehr, Sebastian ;
Soeding, Johannes .
GENOME RESEARCH, 2013, 23 (01) :181-194
[17]   Simple Combinations of Lineage-Determining Transcription Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell Identities [J].
Heinz, Sven ;
Benner, Christopher ;
Spann, Nathanael ;
Bertolino, Eric ;
Lin, Yin C. ;
Laslo, Peter ;
Cheng, Jason X. ;
Murre, Cornelis ;
Singh, Harinder ;
Glass, Christopher K. .
MOLECULAR CELL, 2010, 38 (04) :576-589
[18]   Neuregulin1 signaling targets SRF and CREB and activates the muscle spindle-specific gene Egr3 through a composite SRF-CREB-binding site [J].
Herndon, Carter A. ;
Ankenbruck, Nick ;
Lester, Bridget ;
Bailey, Julie ;
Fromm, Larry .
EXPERIMENTAL CELL RESEARCH, 2013, 319 (05) :718-730
[19]   Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities [J].
Jolma, Arttu ;
Kivioja, Teemu ;
Toivonen, Jarkko ;
Cheng, Lu ;
Wei, Gonghong ;
Enge, Martin ;
Taipale, Mikko ;
Vaquerizas, Juan M. ;
Yan, Jian ;
Sillanpaa, Mikko J. ;
Bonke, Martin ;
Palin, Kimmo ;
Talukder, Shaheynoor ;
Hughes, Timothy R. ;
Luscombe, Nicholas M. ;
Ukkonen, Esko ;
Taipale, Jussi .
GENOME RESEARCH, 2010, 20 (06) :861-873
[20]   Noncanonical DNA Motifs as Transactivation Targets by Wild Type and Mutant p53 [J].
Jordan, Jennifer J. ;
Menendez, Daniel ;
Inga, Alberto ;
Nourredine, Maher ;
Bell, Douglas ;
Resnick, Michael A. .
PLOS GENETICS, 2008, 4 (06)