MMDiff: quantitative testing for shape changes in ChIP-Seq data sets

被引:22
作者
Schweikert, Gabriele [1 ,2 ]
Cseke, Botond [1 ]
Clouaire, Thomas [2 ]
Bird, Adrian [2 ]
Sanguinetti, Guido [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9AB, Midlothian, Scotland
[2] Univ Edinburgh, Wellcome Trust Ctr Cell Biol, Edinburgh EH9 3JR, Midlothian, Scotland
基金
英国惠康基金; 英国生物技术与生命科学研究理事会; 欧洲研究理事会;
关键词
Chip-Seq; Differential peak detection; Kernel methods; Maximum mean discrepancy; Histone modifications; H3K4me3; Cfp1; DIFFERENTIAL EXPRESSION ANALYSIS; CXXC FINGER PROTEIN-1; CYTOSINE METHYLATION; HISTONE H3; BINDING; CHROMATIN; NORMALIZATION; SIGNATURES;
D O I
10.1186/1471-2164-14-826
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Cell-specific gene expression is controlled by epigenetic modifications and transcription factor binding. While genome-wide maps for these protein-DNA interactions have become widely available, quantitative comparison of the resulting ChIP-Seq data sets remains challenging. Current approaches to detect differentially bound or modified regions are mainly borrowed from RNA-Seq data analysis, thus focusing on total counts of fragments mapped to a region, ignoring any information encoded in the shape of the peaks. Results: Here, we present MMDiff, a robust, broadly applicable method for detecting differences between sequence count data sets. Based on quantifying shape changes in signal profiles, it overcomes challenges imposed by the highly structured nature of the data and the paucity of replicates. We first use a simulated data set to compare the performance of MMDiff with results obtained by four alternative methods. We demonstrate that MMDiff excels when peak profiles change between samples. We next use MMDiff to re-analyse a recent data set of the histone modification H3K4me3 elucidating the establishment of this prominent epigenomic marker. Our empirical analysis shows that the method yields reproducible results across experiments, and is able to detect functional important changes in histone modifications. To further explore the broader applicability of MMDiff, we apply it to two ENCODE data sets: one investigating the histone modification H3K27ac and one measuring the genome-wide binding of the transcription factor CTCF. In both cases, MMDiff proves to be complementary to count-based methods. In addition, we can show that MMDiff is capable of directly detecting changes of homotypic binding events at neighbouring binding sites. MMDiff is readily available as a Bioconductor package. Conclusions: Our results demonstrate that higher order features of ChIP-Seq peaks carry relevant and often complementary information to total counts, and hence are important in assessing differential histone modifications and transcription factor binding. We have developed a new computational method, MMDiff, that is capable of exploring these features and therefore closes an existing gap in the analysis of ChIP-Seq data sets.
引用
收藏
页数:17
相关论文
共 44 条
[1]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]   MEME SUITE: tools for motif discovery and searching [J].
Bailey, Timothy L. ;
Boden, Mikael ;
Buske, Fabian A. ;
Frith, Martin ;
Grant, Charles E. ;
Clementi, Luca ;
Ren, Jingyuan ;
Li, Wilfred W. ;
Noble, William S. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W202-W208
[3]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[4]   Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration [J].
Bauer, Sebastian ;
Grossmann, Steffen ;
Vingron, Martin ;
Robinson, Peter N. .
BIOINFORMATICS, 2008, 24 (14) :1650-1651
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   First Exon Length Controls Active Chromatin Signatures and Transcription [J].
Bieberstein, Nicole I. ;
Oesterreich, Fernando Carrillo ;
Straube, Korinna ;
Neugebauer, Karla M. .
CELL REPORTS, 2012, 2 (01) :62-68
[7]   Intra- and inter-chromosomal interactions correlate with CTCF binding genome wide [J].
Botta, Marco ;
Haider, Syed ;
Leung, Ian X. Y. ;
Lio, Pietro ;
Mozziconacci, Julien .
MOLECULAR SYSTEMS BIOLOGY, 2010, 6
[8]   DNA Methyltransferase Protein Synthesis Is Reduced in CXXC Finger Protein 1-Deficient Embryonic Stem Cells [J].
Butler, Jill S. ;
Palam, Lakshmi R. ;
Tate, Courtney M. ;
Sanford, Jeremy R. ;
Wek, Ronald C. ;
Skalnik, David G. .
DNA AND CELL BIOLOGY, 2009, 28 (05) :223-231
[9]   Reduced genomic cytosine methylation and defective cellular differentiation in embryonic stem cells lacking CpG binding protein [J].
Carlone, DL ;
Lee, JH ;
Young, SRL ;
Dobrota, E ;
Butler, JS ;
Ruiz, J ;
Skalnik, DG .
MOLECULAR AND CELLULAR BIOLOGY, 2005, 25 (12) :4881-4891
[10]   An effective statistical evaluation of ChIPseq dataset similarity [J].
Chikina, Maria D. ;
Troyanskaya, Olga G. .
BIOINFORMATICS, 2012, 28 (05) :607-613