An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites

被引:2
|
作者
He, Fei [1 ,2 ]
Li, Jingyi [1 ]
Wang, Rui [1 ]
Zhao, Xiaowei [1 ]
Han, Ye [3 ]
机构
[1] Northeast Normal Univ, Sch Informat Sci & Technol, Changchun 130117, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Peoples R China
[3] Jilin Agr Univ, Sch Informat Technol, Changchun, Peoples R China
关键词
Protein ubiquitylation site; Protein SUMOylation site; Convolution neural network; Deep learning; Ensemble learning; UBIQUITIN;
D O I
10.1186/s12859-021-04445-5
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background Several computational tools for predicting protein Ubiquitylation and SUMOylation sites have been proposed to study their regulatory roles in gene location, gene expression, and genome replication. However, existing methods generally rely on feature engineering, and ignore the natural similarity between the two types of protein translational modification. This study is the first all-in-one deep network to predict protein Ubiquitylation and SUMOylation sites from protein sequences as well as their crosstalk sites simultaneously. Our deep learning architecture integrates several meta classifiers that apply deep neural networks to protein sequence information and physico-chemical properties, which were trained on multi-label classification mode for simultaneously identifying protein Ubiquitylation and SUMOylation as well as their crosstalk sites. Results The promising AUCs of our method on Ubiquitylation, SUMOylation and crosstalk sites achieved 0.838, 0.888, and 0.862 respectively on tenfold cross-validation. The corresponding APs reached 0.683, 0.804 and 0.552, which also validated our effectiveness. Conclusions The proposed architecture managed to classify ubiquitylated and SUMOylated lysine residues along with their crosstalk sites, and outperformed other well-known Ubiquitylation and SUMOylation site prediction tools.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] An Ensemble Deep Learning based Predictor for Simultaneously Identifying Protein Ubiquitylation and SUMOylation Sites
    Fei He
    Jingyi Li
    Rui Wang
    Xiaowei Zhao
    Ye Han
    BMC Bioinformatics, 22
  • [2] Protein Ubiquitylation and Sumoylation Site Prediction Based on Ensemble and Transfer Learning
    He, Fei
    Wang, Rui
    Gao, Yanxin
    Wang, Duolin
    Yu, Yang
    Xu, Dong
    Zhao, Xiaowei
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 117 - 123
  • [3] MTNA: A deep learning based predictor for identifying multiple types of N-terminal protein acetylated sites
    Chen, Yongbing
    Qin, Wenyuan
    Liu, Tong
    Li, Ruikun
    He, Fei
    Han, Ye
    Ma, Zhiqiang
    Ren, Zilin
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (09): : 5442 - 5456
  • [4] PrUb-EL: A hybrid framework based on deep learning for identifying ubiquitination sites in Arabidopsis thaliana using ensemble learning strategy
    Wang, Houqiang
    Li, Hong
    Gao, Weifeng
    Xie, Jin
    ANALYTICAL BIOCHEMISTRY, 2022, 658
  • [5] A Stacking-Based Ensemble Learning Predictor Combined with Particle Swarm Optimizer for Identifying RNA Pseudouridine Sites
    Wang, Xiao
    Li, Pengfei
    Han, Lijun
    Wang, Rong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 521 - 531
  • [6] iSUMO-RsFPN: A predictor for identifying lysine SUMOylation sites based on multi-features and feature pyramid networks
    Lv, Zhe
    Wei, Xin
    Hu, Siqin
    Lin, Gang
    Qiu, Wangren
    ANALYTICAL BIOCHEMISTRY, 2024, 687
  • [7] A deep learning based ensemble approach for protein allergen classification
    Kumar, Arun
    Rana, Prashant Singh
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [8] A deep learning based ensemble approach for protein allergen classification
    Kumar A.
    Rana P.S.
    PeerJ Computer Science, 2023, 9
  • [9] UbiComb: A Hybrid Deep Learning Model for Predicting Plant-Specific Protein Ubiquitylation Sites
    Siraj, Arslan
    Lim, Dae Yeong
    Tayara, Hilal
    Chong, Kil To
    GENES, 2021, 12 (05)
  • [10] Identifying High Risk of Atherosclerosis Using Deep Learning and Ensemble Learning
    Olhosseiny, Hedieh Hashem
    Mirzaloo, Mohammadsalar
    Bolic, Miodrag
    Dajani, Hilmi R.
    Groza, Voicu
    Yoshida, Masayoshi
    2021 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS (IEEE MEMEA 2021), 2021,