Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites

被引:0
|
作者
Zhen Chen [1 ]
Ningning He [1 ]
Yu Huang [2 ]
Wen Tao Qin [3 ]
Xuhan Liu [4 ]
Lei Li [1 ,2 ,5 ]
机构
[1] School of Basic Medicine,Qingdao University
[2] School of Data Science and Software Engineering,Qingdao University
[3] Department of Biochemistry,Schulich School of Medicine and Dentistry,University of Western Ontario
[4] Department of Information Technology,Beijing Oriental Yamei Gene Technology Institute Co.Ltd.
[5] Qingdao Cancer Institute,Qingdao University
基金
中国国家自然科学基金;
关键词
Deep learning; Recurrent neural network; LSTM; Malonylation; Random forest;
D O I
暂无
中图分类号
Q811.4 [生物信息论]; TP18 [人工智能理论];
学科分类号
0711 ; 081104 ; 0812 ; 0831 ; 0835 ; 1405 ;
摘要
As a newly-identified protein post-translational modification, malonylation is involved in a variety of biological functions. Recognizing malonylation sites in substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein malonylation. In this study, we constructed a deep learning(DL) network classifier based on long short-term memory(LSTM) with word embedding(LSTMWE) for the prediction of mammalian malonylation sites.LSTMWEperforms better than traditional classifiers developed with common pre-defined feature encodings or a DL classifier based on LSTM with a one-hot vector. The performance of LSTMWE is sensitive to the size of the training set, but this limitation can be overcome by integration with a traditional machine learning(ML) classifier. Accordingly, an integrated approach called LEMP was developed, which includes LSTMWEand the random forest classifier with a novel encoding of enhanced amino acid content. LEMP performs not only better than the individual classifiers but also superior to the currently-available malonylation predictors. Additionally, it demonstrates a promising performance with a low false positive rate, which is highly useful in the prediction application. Overall, LEMP is a useful tool for easily identifying malonylation sites with high confidence.LEMP is available at http://www.bioinfogo.org/lemp.
引用
收藏
页码:451 / 459
页数:9
相关论文
共 50 条
  • [1] Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites
    Chen, Zhen
    He, Ningning
    Huang, Yu
    Qin, Wen Tao
    Liu, Xuhan
    Li, Lei
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2018, 16 (06) : 451 - 459
  • [2] Integration of A Deep Learning Classifier with A Random Forest Approach for Predicting Malonylation Sites
    Zhen Chen
    Ningning He
    Yu Huang
    Wen Tao Qin
    Xuhan Liu
    Lei Li
    Genomics,Proteomics & Bioinformatics, 2018, (06) : 451 - 459
  • [3] BERMP: a cross-species classifier for predicting m6A sites by integrating a deep learning algorithm and a random forest approach
    Huang, Yu
    He, Ningning
    Chen, Yu
    Chen, Zhen
    Li, Lei
    INTERNATIONAL JOURNAL OF BIOLOGICAL SCIENCES, 2018, 14 (12): : 1669 - 1677
  • [4] RF-MaloSite and DL-Malosite: Methods based on random forest and deep learning to identify malonylation sites
    AL-barakati, Hussam
    Thapa, Niraj
    Hiroto, Saigo
    Roy, Kaushik
    Newman, Robert H.
    Kc, Dukka
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 (18): : 852 - 860
  • [5] Computational Method for Identifying Malonylation Sites by Using Random Forest Algorithm
    Wang, ShaoPeng
    Li, JiaRui
    Sun, Xijun
    Zhang, Yu-Hang
    Huang, Tao
    Cai, Yu-Dong
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2020, 23 (04) : 304 - 312
  • [6] Efficient Learning of Random Forest Classifier using Disjoint Partitioning Approach
    Kulkarni, Vrushali Y.
    Sinha, Pradeep K.
    WORLD CONGRESS ON ENGINEERING - WCE 2013, VOL II, 2013, : 826 - +
  • [7] A hybrid approach for Bangla sign language recognition using deep transfer learning model with random forest classifier
    Das, Sunanda
    Imtiaz, Md. Samir
    Neom, Nieb Hasan
    Siddique, Nazmul
    Wang, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [8] A Novel Spatial Feature For Predicting Lysine Malonylation Sites Using Machine Learning
    Liu, Yuan
    Yan, Changhui
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 76 - 79
  • [9] Sequential Boosting for Learning a Random Forest Classifier
    Baumann, Florian
    Ehlers, Arne
    Rosenhahn, Bodo
    Liu, Wei
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 442 - 447
  • [10] Purchasing Intentions Analysis of Hybrid Cars Using Random Forest Classifier and Deep Learning
    Ong, Ardvin Kester S.
    Cordova, Lara Nicole Z.
    Longanilla, Franscine Althea B.
    Caprecho, Neallo L.
    Javier, Rocksel Andry V.
    Borres, Rianina D.
    German, Josephine D.
    WORLD ELECTRIC VEHICLE JOURNAL, 2023, 14 (08):