MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data
被引:5
|
作者:
Ozaki, Haruka
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778568, Japan
RIKEN, Adv Ctr Comp & Commun, Bioinformat Res Unit, 2-1 Hirosawa, Wako, Saitama 3510198, JapanUniv Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778568, Japan
Ozaki, Haruka
[1
,4
]
Iwasaki, Wataru
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778568, Japan
Univ Tokyo, Grad Sch Sci, Dept Biol Sci, Bunkyo Ku, Hongo 7-3-1, Tokyo 1130032, Japan
Univ Tokyo, Atmosphere & Ocean Res Inst, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778564, JapanUniv Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778568, Japan
Iwasaki, Wataru
[1
,2
,3
]
机构:
[1] Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778568, Japan
[2] Univ Tokyo, Grad Sch Sci, Dept Biol Sci, Bunkyo Ku, Hongo 7-3-1, Tokyo 1130032, Japan
[3] Univ Tokyo, Atmosphere & Ocean Res Inst, Kashiwanoha 5-1-5, Kashiwa, Chiba 2778564, Japan
[4] RIKEN, Adv Ctr Comp & Commun, Bioinformat Res Unit, 2-1 Hirosawa, Wako, Saitama 3510198, Japan
Background: As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Results: Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. Conclusions: By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. (C) 2016 Elsevier Ltd. All rights reserved.
机构:
College of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, ChinaCollege of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, China
Zhang, Li
Hu, Yuansen
论文数: 0引用数: 0
h-index: 0
机构:
College of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, ChinaCollege of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, China
Hu, Yuansen
Wang, Jinshui
论文数: 0引用数: 0
h-index: 0
机构:
College of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, ChinaCollege of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, China
Wang, Jinshui
Zhang, Guangle
论文数: 0引用数: 0
h-index: 0
机构:
College of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, ChinaCollege of Biological Engineering, Henan University of Technology, Zhengzhou, Henan, China