An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites

被引:2
|
作者
He, Fei [1 ,2 ]
Li, Jingyi [1 ]
Wang, Rui [1 ]
Zhao, Xiaowei [1 ]
Han, Ye [3 ]
机构
[1] Northeast Normal Univ, Sch Informat Sci & Technol, Changchun 130117, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Peoples R China
[3] Jilin Agr Univ, Sch Informat Technol, Changchun, Peoples R China
关键词
Protein ubiquitylation site; Protein SUMOylation site; Convolution neural network; Deep learning; Ensemble learning; UBIQUITIN;
D O I
10.1186/s12859-021-04445-5
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, existing methods generally rely on feature engineering, and ignore the natural similarity between the two types of protein translational modification. This study is the first all-in-one deep network to predict protein Ubiquitylation and SUMOylation sites from protein sequences as well as their crosstalk sites simultaneously. Our deep learning architecture integrates several meta classifiers that apply deep neural networks to protein sequence information and physico-chemical properties, which were trained on multi-label classification mode for simultaneously identifying protein Ubiquitylation and SUMOylation as well as their crosstalk sites. Results The promising AUCs of our method on Ubiquitylation, SUMOylation and crosstalk sites achieved 0.838, 0.888, and 0.862 respectively on tenfold cross-validation. The corresponding APs reached 0.683, 0.804 and 0.552, which also validated our effectiveness. Conclusions The proposed architecture managed to classify ubiquitylated and SUMOylated lysine residues along with their crosstalk sites, and outperformed other well-known Ubiquitylation and SUMOylation site prediction tools.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] An ensemble deep learning based IDS for IoT using Lambda architecture
    Rubayyi Alghamdi
    Martine Bellaiche
    Cybersecurity, 6
  • [42] Deep and Ensemble Learning Based Land Use and Land Cover Classification
    Benbriqa, Hicham
    Abnane, Ibtissam
    Idri, Ali
    Tabiti, Khouloud
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT III, 2021, 12951 : 588 - 604
  • [43] Protein deep profile and model predictions for identifying the causal genes of male infertility based on deep learning
    Xu, Fang
    Guo, Ganggang
    Zhu, Feida
    Tan, Xiaojun
    Fan, Liqing
    INFORMATION FUSION, 2021, 75 : 70 - 89
  • [44] Signal Detection Scheme Based on Adaptive Ensemble Deep Learning Model
    Ha, Chang-Bin
    Song, Hyoung-Kyu
    IEEE ACCESS, 2018, 6 : 21342 - 21349
  • [45] A Feature Fusion Predictor for RNA Pseudouridine Sites with Particle Swarm Optimizer Based Feature Selection and Ensemble Learning Approach
    Wang, Xiao
    Lin, Xi
    Wang, Rong
    Han, Nijia
    Fan, Kaiqi
    Han, Lijun
    Ding, Zhaoyuan
    CURRENT ISSUES IN MOLECULAR BIOLOGY, 2021, 43 (03) : 1844 - 1858
  • [46] Prediction of Protein-DNA Binding Sites Based on Protein Language Model and Deep Learning
    Shan, Kaixuan
    Zhang, Xiankun
    Song, Chen
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 314 - 325
  • [47] An ensemble deep learning based IDS for IoT using Lambda architecture
    Alghamdi, Rubayyi
    Bellaiche, Martine
    CYBERSECURITY, 2023, 6 (01)
  • [48] Deep Learning Approaches for the Prediction of Protein Functional Sites
    Pitarch, Borja
    Pazos, Florencio
    MOLECULES, 2025, 30 (02):
  • [49] Identifying Biomedical Entity Based on Deep Learning
    Yang, Rong-Gen
    Wu, Zhi-Xia
    Yang, Zhong
    Yang, Geng
    Gong, Le-Jun
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 713 - 718
  • [50] NCSP-PLM: An ensemble learning framework for predicting non- classical secreted proteins based on protein language models and deep learning
    Liu, Taigang
    Song, Chen
    Wang, Chunhua
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1472 - 1488