A clustering approach for identification of enriched domains from histone modification ChIP-Seq data

被引:766
|
作者
Zang, Chongzhi [1 ]
Schones, Dustin E. [2 ]
Zeng, Chen [1 ]
Cui, Kairong [2 ]
Zhao, Keji [2 ]
Peng, Weiqun [1 ]
机构
[1] George Washington Univ, Dept Phys, Washington, DC 20052 USA
[2] NHLBI, Lab Mol Immunol, NIH, Bethesda, MD 20892 USA
基金
美国国家科学基金会;
关键词
TRANSCRIPTION FACTOR-BINDING; GENOME-WIDE IDENTIFICATION; EMBRYONIC STEM-CELLS; FALSE DISCOVERY RATE; LYSINE; 9; H3; PROTEINS; SITES; METHYLATIONS; RECOGNITION;
D O I
10.1093/bioinformatics/btp340
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Chromatin states are the key to gene regulation and cell identity. Chromatin immunoprecipitation (ChIP) coupled with high-throughput sequencing (ChIP-Seq) is increasingly being used to map epigenetic states across genomes of diverse species. Chromatin modi. cation profiles are frequently noisy and diffuse, spanning regions ranging from several nucleosomes to large domains of multiple genes. Much of the early work on the identification of ChIP-enriched regions for ChIP-Seq data has focused on identifying localized regions, such as transcription factor binding sites. Bioinformatic tools to identify diffuse domains of ChIP-enriched regions have been lacking. Results: Based on the biological observation that histone modi. cations tend to cluster to form domains, we present a method that identifies spatial clusters of signals unlikely to appear by chance. This method pools together enrichment information from neighboring nucleosomes to increase sensitivity and specificity. By using genomic-scale analysis, as well as the examination of loci with validated epigenetic states, we demonstrate that this method outperforms existing methods in the identification of ChIP-enriched signals for histone modi. cation profiles. We demonstrate the application of this unbiased method in important issues in ChIP-Seq data analysis, such as data normalization for quantitative comparison of levels of epigenetic modi. cations across cell types and growth conditions.
引用
收藏
页码:1952 / 1958
页数:7
相关论文
共 50 条
  • [1] An HMM approach to genome-wide identification of differential histone modification sites from ChIP-seq data
    Xu, Han
    Wei, Chia-Lin
    Lin, Feng
    Sung, Wing-Kin
    BIOINFORMATICS, 2008, 24 (20) : 2344 - 2349
  • [2] HERON: A Novel Tool Enables Identification of Long, Weakly Enriched Genomic Domains in ChIP-seq Data
    Macioszek, Anna
    Wilczynski, Bartek
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (15)
  • [3] Histone Modification ChIP-seq on Arabidopsis thaliana Plantlets
    Ramirez-Prado, Juan S.
    Latrasse, David
    Benhamed, Moussa
    BIO-PROTOCOL, 2021, 11 (21):
  • [4] Identifying dispersed epigenomic domains from ChIP-Seq data
    Song, Qiang
    Smith, Andrew D.
    BIOINFORMATICS, 2011, 27 (06) : 870 - 871
  • [5] Quantification of histone modification ChIP-seq enrichment for data mining and machine learning applications
    Hoang S.A.
    Xu X.
    Bekiranov S.
    BMC Research Notes, 4 (1)
  • [6] Quartet-based inference of cell differentiation trees from ChIP-Seq histone modification data
    Moumi, Nazifa Ahmed
    Das, Badhan
    Promi, Zarin Tasnim
    Bristy, Nishat Anjum
    Bayzid, Md Shamsuzzoha
    PLOS ONE, 2019, 14 (09):
  • [7] A New Approach for the Joint Analysis of Multiple Chip-Seq Libraries with Application to Histone Modification
    Ferguson, John P.
    Cho, Judy H.
    Zhao, Hongyu
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2012, 11 (03)
  • [8] RECOGNICER: A coarse-graining approach for identifying broad domains from ChIP-seq data
    Chongzhi Zang
    Yiren Wang
    Weiqun Peng
    Quantitative Biology, 2020, 8 (04) : 359 - 368
  • [9] A Clustering Approach for Motif Discovery in ChIP-Seq Dataset
    Sun, Chun-xiao
    Yang, Yu
    Wang, Hua
    Wang, Wen-hu
    ENTROPY, 2019, 21 (08)
  • [10] RECOGNICER: A coarse-graining approach for identifying broad domains from ChIP-seq data
    Zang, Chongzhi
    Wang, Yiren
    Peng, Weiqun
    QUANTITATIVE BIOLOGY, 2020, 8 (04) : 359 - 368