Predicting enhancers in mammalian genomes using supervised hidden Markov models

被引:10
作者
Zehnder, Tobias [1 ]
Benner, Philipp [1 ]
Vingron, Martin [1 ]
机构
[1] Max Planck Inst Mol Genet, Ihnestr 63-73, D-14195 Berlin, Germany
关键词
Enhancer prediction; Epigenetics; Gene regulation; Supervised hidden Markov models; CHROMATIN-STRUCTURE; DNA METHYLATION; ELEMENTS; REVEALS; WIDE; TRANSCRIPTION; ANNOTATION; EXPRESSION; PROMOTERS; DISCOVERY;
D O I
10.1186/s12859-019-2708-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundEukaryotic gene regulation is a complex process comprising the dynamic interaction of enhancers and promoters in order to activate gene expression. In recent years, research in regulatory genomics has contributed to a better understanding of the characteristics of promoter elements and for most sequenced model organism genomes there exist comprehensive and reliable promoter annotations. For enhancers, however, a reliable description of their characteristics and location has so far proven to be elusive. With the development of high-throughput methods such as ChIP-seq, large amounts of data about epigenetic conditions have become available, and many existing methods use the information on chromatin accessibility or histone modifications to train classifiers in order to segment the genome into functional groups such as enhancers and promoters. However, these methods often do not consider prior biological knowledge about enhancers such as their diverse lengths or molecular structure.ResultsWe developed enhancer HMM (eHMM), a supervised hidden Markov model designed to learn the molecular structure of promoters and enhancers. Both consist of a central stretch of accessible DNA flanked by nucleosomes with distinct histone modification patterns. We evaluated the performance of eHMM within and across cell types and developmental stages and found that eHMM successfully predicts enhancers with high precision and recall comparable to state-of-the-art methods, and consistently outperforms those in terms of accuracy and resolution.ConclusionseHMM predicts active enhancers based on data from chromatin accessibility assays and a minimal set of histone modification ChIP-seq experiments. In comparison to other 'black box' methods its parameters are easy to interpret. eHMM can be used as a stand-alone tool for enhancer prediction without the need for additional training or a tuning of parameters. The high spatial precision of enhancer predictions gives valuable targets for potential knockout experiments or downstream analyses such as motif search.
引用
收藏
页数:12
相关论文
共 59 条
[1]  
Alberts B., 2014, Molecular Biology of the Cell: Sixth International Student Edition
[2]   An atlas of active enhancers across human cell types and tissues [J].
Andersson, Robin ;
Gebhard, Claudia ;
Miguel-Escalada, Irene ;
Hoof, Ilka ;
Bornholdt, Jette ;
Boyd, Mette ;
Chen, Yun ;
Zhao, Xiaobei ;
Schmidl, Christian ;
Suzuki, Takahiro ;
Ntini, Evgenia ;
Arner, Erik ;
Valen, Eivind ;
Li, Kang ;
Schwarzfischer, Lucia ;
Glatz, Dagmar ;
Raithel, Johanna ;
Lilje, Berit ;
Rapin, Nicolas ;
Bagger, Frederik Otzen ;
Jorgensen, Mette ;
Andersen, Peter Refsing ;
Bertin, Nicolas ;
Rackham, Owen ;
Burroughs, A. Maxwell ;
Baillie, J. Kenneth ;
Ishizu, Yuri ;
Shimizu, Yuri ;
Furuhata, Erina ;
Maeda, Shiori ;
Negishi, Yutaka ;
Mungall, Christopher J. ;
Meehan, Terrence F. ;
Lassmann, Timo ;
Itoh, Masayoshi ;
Kawaji, Hideya ;
Kondo, Naoto ;
Kawai, Jun ;
Lennartsson, Andreas ;
Daub, Carsten O. ;
Heutink, Peter ;
Hume, David A. ;
Jensen, Torben Heick ;
Suzuki, Harukazu ;
Hayashizaki, Yoshihide ;
Mueller, Ferenc ;
Forrest, Alistair R. R. ;
Carninci, Piero ;
Rehli, Michael ;
Sandelin, Albin .
NATURE, 2014, 507 (7493) :455-+
[3]   Unmasking risk loci: DNA methylation illuminates the biology of cancer predisposition Analyzing DNA methylation of transcriptional enhancers reveals missed regulatory links between cancer risk loci and genes [J].
Aran, Dvir ;
Hellman, Asaf .
BIOESSAYS, 2014, 36 (02) :184-190
[4]  
Barrett T, 2013, DATABASE, VJ41, pD991
[5]   ChIP-Seq identification of weakly conserved heart enhancers [J].
Blow, Matthew J. ;
McCulley, David J. ;
Li, Zirong ;
Zhang, Tao ;
Akiyama, Jennifer A. ;
Holt, Amy ;
Plajzer-Frick, Ingrid ;
Shoukry, Malak ;
Wright, Crystal ;
Chen, Feng ;
Afzal, Veena ;
Bristow, James ;
Ren, Bing ;
Black, Brian L. ;
Rubin, Edward M. ;
Visel, Axel ;
Pennacchio, Len A. .
NATURE GENETICS, 2010, 42 (09) :806-U107
[6]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[7]  
Buenrostro JD, 2013, NAT METHODS, V10, P1213, DOI [10.1038/NMETH.2688, 10.1038/nmeth.2688]
[8]   Modification of Enhancer Chromatin: What, How, and Why? [J].
Calo, Eliezer ;
Wysocka, Joanna .
MOLECULAR CELL, 2013, 49 (05) :825-837
[9]  
Chan HM, 2001, J CELL SCI, V114, P2363
[10]   Integration of external signaling pathways with the core transcriptional network in embryonic stem cells [J].
Chen, Xi ;
Xu, Han ;
Yuan, Ping ;
Fang, Fang ;
Huss, Mikael ;
Vega, Vinsensius B. ;
Wong, Eleanor ;
Orlov, Yuriy L. ;
Zhang, Weiwei ;
Jiang, Jianming ;
Loh, Yuin-Han ;
Yeo, Hock Chuan ;
Yeo, Zhen Xuan ;
Narang, Vipin ;
Govindarajan, Kunde Ramamoorthy ;
Leong, Bernard ;
Shahab, Atif ;
Ruan, Yijun ;
Bourque, Guillaume ;
Sung, Wing-Kin ;
Clarke, Neil D. ;
Wei, Chia-Lin ;
Ng, Huck-Hui .
CELL, 2008, 133 (06) :1106-1117