Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites

被引:76
作者
Chen, Zhen [1 ]
He, Ningning [1 ]
Huang, Yu [2 ]
Qin, Wen Tao [3 ]
Liu, Xuhan [4 ]
Li, Lei [1 ,2 ,5 ]
机构
[1] Qingdao Univ, Sch Basic Med, Qingdao 266021, Peoples R China
[2] Qingdao Univ, Sch Data Sci & Software Engn, Qingdao 266021, Peoples R China
[3] Univ Western Ontario, Schulich Sch Med & Dent, Dept Biochem, London, ON N6A 5C1, Canada
[4] Beijing Oriental Yamei Gene Technol Inst Co Ltd, Dept Informat Technol, Beijing 100078, Peoples R China
[5] Qingdao Univ, Qingdao Canc Inst, Qingdao 266021, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Recurrent neural network; LSTM; Malonylation; Random forest; LYSINE MALONYLATION; UBIQUITINATION SITES; NEURAL-NETWORKS; PROTEIN; SUCCINYLATION; SETS;
D O I
10.1016/j.gpb.2018.08.004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
As a newly-identified protein post-translational modification, malonylation is involved in a variety of biological functions. Recognizing malonylation sites in substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein malonylation. In this study, we constructed a deep learning (DL) network classifier based on long short-term memory (LSTM) with word embedding (LSTMWE) for the prediction of mammalian malonylation sites. LSTMWE performs better than traditional classifiers developed with common pre-defined feature encodings or a DL classifier based on LSTM with a one-hot vector. The performance of LSTM(WE )is sensitive to the size of the training set, but this limitation can be overcome by integration with a traditional machine learning (ML) classifier. Accordingly, an integrated approach called LEMP was developed, which includes LSTMWE and the random forest classifier with a novel encoding of enhanced amino acid content. LEMP performs not only better than the individual classifiers but also superior to the currently-available malonylation predictors. Additionally, it demonstrates a promising performance with a low false positive rate, which is highly useful in the prediction application. Overall, LEMP is a useful tool for easily identifying malonylation sites with high confidence. LEMP is available at http://www.bioinfogo.org/lemp.
引用
收藏
页码:451 / 459
页数:9
相关论文
共 44 条
[31]   Prediction of Protein-Protein Interaction Sites in Sequences and 3D Structures by Random Forests [J].
Sikic, Mile ;
Tomic, Sanja ;
Vlahovicek, Kristian .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (01)
[32]  
Srivastava N, 2014, J MACH LEARN RES, V15, P1929
[33]   Two Sample Logo: a graphical representation of the differences between two sets of sequence alignments [J].
Vacic, Vladimir ;
Iakoucheva, Lilia M. ;
Radivojac, Predrag .
BIOINFORMATICS, 2006, 22 (12) :1536-1537
[34]   Prediction of post-translational modification sites using multiple kernel support vector machine [J].
Wang, BingHua ;
Wang, Minghui ;
Li, Ao .
PEERJ, 2017, 5
[35]   MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction [J].
Wang, Duolin ;
Zeng, Shuai ;
Xu, Chunhui ;
Qiu, Wangren ;
Liang, Yanchun ;
Joshi, Trupti ;
Xu, Dong .
BIOINFORMATICS, 2017, 33 (24) :3909-3916
[36]   Computational prediction of species-specific malonylation sites via enhanced characteristic strategy [J].
Wang, Li-Na ;
Shi, Shao-Ping ;
Xu, Hao-Dong ;
Wen, Ping-Ping ;
Qiu, Jian-Ding .
BIOINFORMATICS, 2017, 33 (10) :1457-1463
[37]   A novel method for predicting post-translational modifications on serine and threonine sites by using site-modification network profiles [J].
Wang, Minghui ;
Jiang, Yujie ;
Xu, Xiaoyi .
MOLECULAR BIOSYSTEMS, 2015, 11 (11) :3092-3100
[38]   Predicting Residue-Residue Contacts and Helix-Helix Interactions in Transmembrane Proteins Using an Integrative Feature-Based Random Forest Approach [J].
Wang, Xiao-Feng ;
Chen, Zhen ;
Wang, Chuan ;
Yan, Ren-Xiang ;
Zhang, Ziding ;
Song, Jiangning .
PLOS ONE, 2011, 6 (10)
[39]   DeepNitro: Prediction of Protein Nitration and Nitrosylation Sites by Deep Learning [J].
Xie, Yubin ;
Luo, Xiaotong ;
Li, Yupeng ;
Chen, Li ;
Ma, Wenbin ;
Huang, Junjiu ;
Cui, Jun ;
Zhao, Yong ;
Xue, Yu ;
Zuo, Zhixiang ;
Ren, Jian .
GENOMICS PROTEOMICS & BIOINFORMATICS, 2018, 16 (04) :294-306
[40]   Lysine Succinylation and Lysine Malonylation in Histones [J].
Xie, Zhongyu ;
Dai, Junbiao ;
Dai, Lunzhi ;
Tan, Minjia ;
Cheng, Zhongyi ;
Wu, Yeming ;
Boeke, Jef D. ;
Zhao, Yingming .
MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (05) :100-107