Differential Gene Expression Prediction by Ensemble Deep Networks on Histone Modification Data

被引:1
作者
Huang, Zimo [1 ]
Wang, Jun [1 ,2 ]
Yan, Zhongmin [1 ]
Wan, Lin [1 ]
Guo, Maozu [3 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Shandong, Peoples R China
[2] Shandong Univ, Joint SDU NTU Ctr Artificial Intellige Res, Jinan 250101, Shandong, Peoples R China
[3] Beijing Univ Civil Engn & Architecture, Coll Elect & Informat Engn, Beijing 102616, Peoples R China
基金
中国国家自然科学基金;
关键词
Histone modification; differential expressed gene; deep neural networks; ensemble learning; feature fusion; CHROMATIN STATE; GENOME; LANGUAGE;
D O I
10.1109/TCBB.2021.3139634
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Predicting differential gene expression (DGE) from Histone modifications (HM) signal is crucial to understand how HM controls cell functional heterogeneity through influencing differential gene regulation. Most existing prediction methods use fixed-length bins to represent HM signals and transmit these bins into a single machine learning model to predict differential expression genes of single cell type or cell type pair. However, the inappropriate bin length may cause the splitting of the important HM segment and lead to information loss. Furthermore, the bias of single learning model may limit the prediction accuracy. Considering these problems, in this paper, we proposes an Ensemble deep neural networks framework for predicting Differential Gene Expression (EnDGE). EnDGE employs different feature extractors on input HM signal data with different bin lengths and fuses the feature vectors for DGE prediction. Ensemble multiple learning models with different HM signal cutting strategies helps to keep the integrity and consistency of genetic information in each signal segment, and offset the bias of individual models. Besides the popular feature extractors, we also propose a new Residual Network based model with higher prediction accuracy to increase the diversity of feature extractors. Experiments on the real datasets from the Roadmap Epigenome Project (REMC) show that for all cell type pairs, EnDGE significantly outperforms the state-of-the-art baselines for differential gene expression prediction.
引用
收藏
页码:340 / 351
页数:12
相关论文
共 49 条
[21]  
Jian YQ, 2017, SCI REP-UK, V7, DOI [10.1038/s41598-017-06153-8, 10.1038/s41598-017-06348-z]
[22]   Histone modification levels are predictive for gene expression [J].
Karlic, Rosa ;
Chung, Ho-Ryun ;
Lasserre, Julia ;
Vlahovicek, Kristian ;
Vingron, Martin .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (07) :2926-2931
[23]  
Kingma Diederik P, 2014, INT C LEARNING REPRE, V1, P15
[24]   The Next-Generation Sequencing Revolution and Its Impact on Genomics [J].
Koboldt, Daniel C. ;
Steinberg, Karyn Meltz ;
Larson, David E. ;
Wilson, Richard K. ;
Mardis, Elaine R. .
CELL, 2013, 155 (01) :27-38
[25]   The landscape of histone modifications across 1% of the human genome in five human cell lines [J].
Koch, Christoph M. ;
Andrews, Robert M. ;
Flicek, Paul ;
Dillon, Shane C. ;
Karaoz, Ulas ;
Clelland, Gayle K. ;
Wilcox, Sarah ;
Beare, David M. ;
Fowler, Joanna C. ;
Couttet, Phillippe ;
James, Keith D. ;
Lefebvre, Gregory C. ;
Bruce, Alexander W. ;
Dovey, Oliver M. ;
Ellis, Peter D. ;
Dhami, Pawandeep ;
Langford, Cordelia F. ;
Weng, Zhiping ;
Birney, Ewan ;
Carter, Nigel P. ;
Vetrie, David ;
Dunham, Ian .
GENOME RESEARCH, 2007, 17 (06) :691-707
[26]   Chromatin modifications and their function [J].
Kouzarides, Tony .
CELL, 2007, 128 (04) :693-705
[27]   Integrative analysis of 111 reference human epigenomes [J].
Kundaje, Anshul ;
Meuleman, Wouter ;
Ernst, Jason ;
Bilenky, Misha ;
Yen, Angela ;
Heravi-Moussavi, Alireza ;
Kheradpour, Pouya ;
Zhang, Zhizhuo ;
Wang, Jianrong ;
Ziller, Michael J. ;
Amin, Viren ;
Whitaker, John W. ;
Schultz, Matthew D. ;
Ward, Lucas D. ;
Sarkar, Abhishek ;
Quon, Gerald ;
Sandstrom, Richard S. ;
Eaton, Matthew L. ;
Wu, Yi-Chieh ;
Pfenning, Andreas R. ;
Wang, Xinchen ;
Claussnitzer, Melina ;
Liu, Yaping ;
Coarfa, Cristian ;
Harris, R. Alan ;
Shoresh, Noam ;
Epstein, Charles B. ;
Gjoneska, Elizabeta ;
Leung, Danny ;
Xie, Wei ;
Hawkins, R. David ;
Lister, Ryan ;
Hong, Chibo ;
Gascard, Philippe ;
Mungall, Andrew J. ;
Moore, Richard ;
Chuah, Eric ;
Tam, Angela ;
Canfield, Theresa K. ;
Hansen, R. Scott ;
Kaul, Rajinder ;
Sabo, Peter J. ;
Bansal, Mukul S. ;
Carles, Annaick ;
Dixon, Jesse R. ;
Farh, Kai-How ;
Feizi, Soheil ;
Karlic, Rosa ;
Kim, Ah-Ram ;
Kulkarni, Ashwinikumar .
NATURE, 2015, 518 (7539) :317-330
[28]   Establishing, maintaining and modifying DNA methylation patterns in plants and animals [J].
Law, Julie A. ;
Jacobsen, Steven E. .
NATURE REVIEWS GENETICS, 2010, 11 (03) :204-220
[29]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[30]   The Specificity and Topology of Chromatin Interaction Pathways in Yeast [J].
Lenstra, Tineke L. ;
Benschop, Joris J. ;
Kim, TaeSoo ;
Schulze, Julia M. ;
Brabers, Nathalie A. C. H. ;
Margaritis, Thanasis ;
van de Pasch, Loes A. L. ;
van Heesch, Sebastiaan A. A. C. ;
Brok, Mariel O. ;
Koerkamp, Marian J. A. Groot ;
Ko, Cheuk W. ;
van Leenen, Dik ;
Sameith, Katrin ;
van Hooff, Sander R. ;
Lijnzaad, Philip ;
Kemmeren, Patrick ;
Hentrich, Thomas ;
Kobor, Michael S. ;
Buratowski, Stephen ;
Holstege, Frank C. P. .
MOLECULAR CELL, 2011, 42 (04) :536-549