Comparative annotation of functional regions in the human genome using epigenomic data

被引:30
作者
Won, Kyoung-Jae [1 ,2 ]
Zhang, Xian [1 ]
Wang, Tao [1 ]
Ding, Bo [1 ]
Raha, Debasish [3 ]
Snyder, Michael [3 ]
Ren, Bing [4 ,5 ]
Wang, Wei [1 ,5 ]
机构
[1] Univ Calif San Diego, Dept Chem & Biochem, La Jolla, CA 92093 USA
[2] Univ Penn, Perelman Sch Med, Inst Diabet Obes & Metab, Dept Genet, Philadelphia, PA 19104 USA
[3] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
[4] UCSD Sch Med, Ludwig Inst Canc Res, La Jolla, CA 92093 USA
[5] UCSD Sch Med, Dept Cellular & Mol Med, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
TRANSCRIPTION FACTOR-BINDING; HIDDEN-MARKOV-MODEL; PREDICTIVE CHROMATIN SIGNATURES; EMBRYONIC STEM-CELLS; GENE-EXPRESSION; HISTONE MODIFICATIONS; CHIP-SEQ; REGULATORY ELEMENTS; SPEECH RECOGNITION; DNA ELEMENTS;
D O I
10.1093/nar/gkt143
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Epigenetic regulation is dynamic and cell-type dependent. The recently available epigenomic data in multiple cell types provide an unprecedented opportunity for a comparative study of epigenetic landscape. We developed a machine-learning method called ChroModule to annotate the epigenetic states in eight ENCyclopedia Of DNA Elements cell types. The trained model successfully captured the characteristic histone-modification patterns associated with regulatory elements, such as promoters and enhancers, and showed superior performance on identifying enhancers compared with the state-of-art methods. In addition, given the fixed number of epigenetic states in the model, ChroModule allows straightforward illustration of epigenetic variability in multiple cell types. Using this feature, we found that invariable and variable epigenetic states across cell types correspond to housekeeping functions and stimulus response, respectively. Especially, we observed that enhancers, but not the other regulatory elements, dictate cell specificity, as similar cell types share common enhancers, and cell-type-specific enhancers are often bound by transcription factors playing critical roles in that cell type. More interestingly, we found some genomic regions are dormant in cell type but primed to become active in other cell types. These observations highlight the usefulness of ChroModule in comparative analysis and interpretation of multiple epigenomes.
引用
收藏
页码:4423 / 4432
页数:10
相关论文
共 50 条
[41]   EPIGENE: genome-wide transcription unit annotation using a multivariate probabilistic model of histone modifications [J].
Anshupa Sahu ;
Na Li ;
Ilona Dunkel ;
Ho-Ryun Chung .
Epigenetics & Chromatin, 13
[42]   Ultrafast and scalable variant annotation and prioritization with big functional genomics data [J].
Huang, Dandan ;
Yi, Xianfu ;
Zhou, Yao ;
Yao, Hongcheng ;
Xu, Hang ;
Wang, Jianhua ;
Zhang, Shijie ;
Nong, Wenyan ;
Wang, Panwen ;
Shi, Lei ;
Xuan, Chenghao ;
Li, Miaoxin ;
Wang, Junwen ;
Li, Weidong ;
Kwan, Hoi Shan ;
Sham, Pak Chung ;
Wang, Kai ;
Li, Mulin Jun .
GENOME RESEARCH, 2020, 30 (12) :1789-1801
[43]   Systematic discovery of conservation states for single-nucleotide annotation of the human genome [J].
Arneson, Adriana ;
Ernst, Jason .
COMMUNICATIONS BIOLOGY, 2019, 2 (1)
[44]   Using functional annotation to improve clusterings of gene expression patterns [J].
Jonsson, P ;
Laurio, K ;
Lubovac, Z ;
Olsson, B ;
Andersson, ML .
INFORMATION SCIENCES, 2002, 145 (3-4) :183-194
[45]   Functional annotation of human long noncoding RNAs via molecular phenotyping [J].
Ramilowski, Jordan A. ;
Yip, Chi Wai ;
Agrawal, Saumya ;
Chang, Jen-Chien ;
Ciani, Yari ;
Kulakovskiy, Ivan V. ;
Mendez, Mickael ;
Ooi, Jasmine Li Ching ;
Ouyang, John F. ;
Parkinson, Nick ;
Petri, Andreas ;
Roos, Leonie ;
Severin, Jessica ;
Yasuzawa, Kayoko ;
Abugessaisa, Imad ;
Akalin, Altuna ;
Antonov, Ivan V. ;
Arner, Erik ;
Bonetti, Alessandro ;
Bono, Hidemasa ;
Borsari, Beatrice ;
Brombacher, Frank ;
Cameron, Chris J. F. ;
Cannistraci, Carlo Vittorio ;
Cardenas, Ryan ;
Cardon, Melissa ;
Chang, Howard ;
Dostie, Josee ;
Ducoli, Luca ;
Favorov, Alexander ;
Fort, Alexandre ;
Garrido, Diego ;
Gil, Noa ;
Gimenez, Juliette ;
Guler, Reto ;
Handoko, Lusy ;
Harshbarger, Jayson ;
Hasegawa, Akira ;
Hasegawa, Yuki ;
Hashimoto, Kosuke ;
Hayatsu, Norihito ;
Heutink, Peter ;
Hirose, Tetsuro ;
Imada, Eddie L. ;
Itoh, Masayoshi ;
Kaczkowski, Bogumil ;
Kanhere, Aditi ;
Kawabata, Emily ;
Kawaji, Hideya ;
Kawashima, Tsugumi .
GENOME RESEARCH, 2020, 30 (07) :1060-1072
[46]   Whole human genome proteogenomic mapping for ENCODE cell line data: identifying protein-coding regions [J].
Khatun, Jainab ;
Yu, Yanbao ;
Wrobel, John A. ;
Risk, Brian A. ;
Gunawardena, Harsha P. ;
Secrest, Ashley ;
Spitzer, Wendy J. ;
Xie, Ling ;
Wang, Li ;
Chen, Xian ;
Giddings, Morgan C. .
BMC GENOMICS, 2013, 14
[47]   Comparing genomic and epigenomic features across species using the WashU Comparative Epigenome Browser [J].
Zhuo, Xiaoyu ;
Hsu, Silas ;
Purushotham, Deepak ;
Kuntala, Prashant Kumar ;
Harrison, Jessica K. ;
Du, Alan Y. ;
Chen, Samuel ;
Li, Daofeng ;
Wang, Ting .
GENOME RESEARCH, 2023, 33 (05) :824-835
[48]   Defining functional DNA elements in the human genome [J].
Kellis, Manolis ;
Wold, Barbara ;
Snyder, Michael P. ;
Bernstein, Bradley E. ;
Kundaje, Anshul ;
Marinov, Georgi K. ;
Ward, Lucas D. ;
Birney, Ewan ;
Crawford, Gregory E. ;
Dekker, Job ;
Dunham, Ian ;
Elnitski, Laura L. ;
Farnham, Peggy J. ;
Feingold, Elise A. ;
Gerstein, Mark ;
Giddings, Morgan C. ;
Gilbert, David M. ;
Gingeras, Thomas R. ;
Green, Eric D. ;
Guigo, Roderic ;
Hubbard, Tim ;
Kent, Jim ;
Lieb, Jason D. ;
Myers, Richard M. ;
Pazin, Michael J. ;
Ren, Bing ;
Stamatoyannopoulos, John A. ;
Weng, Zhiping ;
White, Kevin P. ;
Hardison, Ross C. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2014, 111 (17) :6131-6138
[49]   Experimental annotation of the human pathogen Candida albicans coding and noncoding transcribed regions using high-resolution tiling arrays [J].
Sellam, Adnane ;
Hogues, Herve ;
Askew, Christopher ;
Tebbji, Faiza ;
van het Hoog, Marco ;
Lavoie, Hugo ;
Kumamoto, Carol A. ;
Whiteway, Malcolm ;
Nantel, Andre .
GENOME BIOLOGY, 2010, 11 (07)
[50]   Decoding the oak genome: public release of sequence data, assembly, annotation and publication strategies [J].
Plomion, Christophe ;
Aury, Jean-Marc ;
Amselem, Joelle ;
Alaeitabar, Tina ;
Barbe, Valerie ;
Belser, Caroline ;
Berges, Helene ;
Bodenes, Catherine ;
Boudet, Nathalie ;
Boury, Christophe ;
Canaguier, Aurelie ;
Couloux, Arnaud ;
Da Silva, Corinne ;
Duplessis, Sebastien ;
Ehrenmann, Francois ;
Estrada-Mairey, Barbara ;
Fouteau, Stephanie ;
Francillonne, Nicolas ;
Gaspin, Christine ;
Guichard, Cecile ;
Klopp, Christophe ;
Labadie, Karine ;
Lalanne, Celine ;
Le Clainche, Isabelle ;
Leple, Jean-Charles ;
Le Provost, Gregoire ;
Leroy, Thibault ;
Lesur, Isabelle ;
Martin, Francis ;
Mercier, Jonathan ;
Michotey, Celia ;
Murat, Florent ;
Salin, Franck ;
Steinbach, Delphine ;
Faivre-Rampant, Patricia ;
Wincker, Patrick ;
Salse, Jerome ;
Quesneville, Hadi ;
Kremer, Antoine .
MOLECULAR ECOLOGY RESOURCES, 2016, 16 (01) :254-265