Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites

被引:75
作者
Chen, Zhen [1 ]
He, Ningning [1 ]
Huang, Yu [2 ]
Qin, Wen Tao [3 ]
Liu, Xuhan [4 ]
Li, Lei [1 ,2 ,5 ]
机构
[1] Qingdao Univ, Sch Basic Med, Qingdao 266021, Peoples R China
[2] Qingdao Univ, Sch Data Sci & Software Engn, Qingdao 266021, Peoples R China
[3] Univ Western Ontario, Schulich Sch Med & Dent, Dept Biochem, London, ON N6A 5C1, Canada
[4] Beijing Oriental Yamei Gene Technol Inst Co Ltd, Dept Informat Technol, Beijing 100078, Peoples R China
[5] Qingdao Univ, Qingdao Canc Inst, Qingdao 266021, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Recurrent neural network; LSTM; Malonylation; Random forest; LYSINE MALONYLATION; UBIQUITINATION SITES; NEURAL-NETWORKS; PROTEIN; SUCCINYLATION; SETS;
D O I
10.1016/j.gpb.2018.08.004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As a newly-identified protein post-translational modification, malonylation is involved in a variety of biological functions. Recognizing malonylation sites in substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein malonylation. In this study, we constructed a deep learning (DL) network classifier based on long short-term memory (LSTM) with word embedding (LSTMWE) for the prediction of mammalian malonylation sites. LSTMWE performs better than traditional classifiers developed with common pre-defined feature encodings or a DL classifier based on LSTM with a one-hot vector. The performance of LSTM(WE )is sensitive to the size of the training set, but this limitation can be overcome by integration with a traditional machine learning (ML) classifier. Accordingly, an integrated approach called LEMP was developed, which includes LSTMWE and the random forest classifier with a novel encoding of enhanced amino acid content. LEMP performs not only better than the individual classifiers but also superior to the currently-available malonylation predictors. Additionally, it demonstrates a promising performance with a low false positive rate, which is highly useful in the prediction application. Overall, LEMP is a useful tool for easily identifying malonylation sites with high confidence. LEMP is available at http://www.bioinfogo.org/lemp.
引用
收藏
页码:451 / 459
页数:9
相关论文
共 44 条
[1]   A Chemical Probe for Lysine Malonylation [J].
Bao, Xiucong ;
Zhao, Qian ;
Yang, Tangpo ;
Fung, Yi Man Eva ;
Li, Xiang David .
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2013, 52 (18) :4883-4886
[2]   Oncogenic kinase signalling [J].
Blume-Jensen, P ;
Hunter, T .
NATURE, 2001, 411 (6835) :355-365
[3]   Deep Learning and Its Applications in Biomedicine [J].
Cao, Chensi ;
Liu, Feng ;
Tan, Hai ;
Song, Deshou ;
Shu, Wenjie ;
Li, Weizhong ;
Zhou, Yiming ;
Bo, Xiaochen ;
Xie, Zhi .
GENOMICS PROTEOMICS & BIOINFORMATICS, 2018, 16 (01) :17-32
[4]   Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs [J].
Chen, Ke ;
Kurgan, Lukasz A. ;
Ruan, Jishou .
BMC STRUCTURAL BIOLOGY, 2007, 7
[5]   Prediction of protein crystallization using collocation of amino acid pairs [J].
Chen, Ke ;
Kurgan, Lukasz ;
Rahbari, Mandana .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2007, 355 (03) :764-769
[6]   Prediction of protein-protein interactions using random decision forest framework [J].
Chen, XW ;
Liu, M .
BIOINFORMATICS, 2005, 21 (24) :4394-4400
[7]   SUMOhydro: A Novel Method for the Prediction of Sumoylation Sites Based on Hydrophobic Properties [J].
Chen, Yong-Zi ;
Chen, Zhen ;
Gong, Yu-Ai ;
Ying, Guoguang .
PLOS ONE, 2012, 7 (06)
[8]   Toward an Understanding of the Molecular Mechanisms of Barnacle Larval Settlement: A Comparative Transcriptomic Approach [J].
Chen, Zhang-Fan ;
Matsumura, Kiyotaka ;
Wang, Hao ;
Arellano, Shawn M. ;
Yan, Xingcheng ;
Alam, Intikhab ;
Archer, John A. C. ;
Bajic, Vladimir B. ;
Qian, Pei-Yuan .
PLOS ONE, 2011, 6 (07)
[9]   Towards more accurate prediction of ubiquitination sites: a comprehensive review of current methods, tools and features [J].
Chen, Zhen ;
Zhou, Yuan ;
Zhang, Ziding ;
Song, Jiangning .
BRIEFINGS IN BIOINFORMATICS, 2015, 16 (04) :640-657
[10]   hCKSAAP_UbSite: Improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties [J].
Chen, Zhen ;
Zhou, Yuan ;
Song, Jiangning ;
Zhang, Ziding .
BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2013, 1834 (08) :1461-1467