Succinylation Site Prediction Based on Protein Sequences Using the IFS-LightGBM (BO) Model

被引:20
|
作者
Zhang, Lu [1 ]
Liu, Min [1 ]
Qin, Xinyi [1 ]
Liu, Guangzhong [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, 1550 Haigang Ave, Shanghai 201306, Peoples R China
基金
上海市自然科学基金;
关键词
LYSINE SUCCINYLATION; POSTTRANSLATIONAL MODIFICATION; UBIQUITINATION SITES; IDENTIFICATION; EXPRESSION; PATTERNS; SIRT5; TOOL;
D O I
10.1155/2020/8858489
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Succinylation is an important posttranslational modification of proteins, which plays a key role in protein conformation regulation and cellular function control. Many studies have shown that succinylation modification on protein lysine residue is closely related to the occurrence of many diseases. To understand the mechanism of succinylation profoundly, it is necessary to identify succinylation sites in proteins accurately. In this study, we develop a new model, IFS-LightGBM (BO), which utilizes the incremental feature selection (IFS) method, the LightGBM feature selection method, the Bayesian optimization algorithm, and the LightGBM classifier, to predict succinylation sites in proteins. Specifically, pseudo amino acid composition (PseAAC), position-specific scoring matrix (PSSM), disorder status, and Composition of k-spaced Amino Acid Pairs (CKSAAP) are firstly employed to extract feature information. Then, utilizing the combination of the LightGBM feature selection method and the incremental feature selection (IFS) method selects the optimal feature subset for the LightGBM classifier. Finally, to increase prediction accuracy and reduce the computation load, the Bayesian optimization algorithm is used to optimize the parameters of the LightGBM classifier. The results reveal that the IFS-LightGBM (BO)-based prediction model performs better when it is evaluated by some common metrics, such as accuracy, recall, precision, Matthews Correlation Coefficient (MCC), and F-measure.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] The Prediction of Succinylation Site in Protein by Analyzing Amino Acid Composition
    Van-Minh Bui
    Van-Nui Nguyen
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 633 - 642
  • [2] DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction
    Thapa, Niraj
    Chaudhari, Meenal
    McManus, Sean
    Roy, Kaushik
    Newman, Robert H.
    Saigo, Hiroto
    KC, Dukka B.
    BMC BIOINFORMATICS, 2020, 21 (Suppl 3)
  • [3] Detecting Succinylation sites from protein sequences using ensemble support vector machine
    Ning, Qiao
    Zhao, Xiaosa
    Bao, Lingling
    Ma, Zhiqiang
    Zhao, Xiaowei
    BMC BIOINFORMATICS, 2018, 19
  • [4] Improving protein succinylation sites prediction using embeddings from protein language model
    Pokharel, Suresh
    Pratyush, Pawel
    Heinzinger, Michael
    Newman, Robert H.
    Dukka, B. K. C.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] A protein succinylation sites prediction method based on the hybrid architecture of LSTM network and CNN
    Zhang, Die
    Wang, Shunfang
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2022, 20 (02)
  • [6] Computational methods for ubiquitination site prediction using physicochemical properties of protein sequences
    Cai, Binghuang
    Jiang, Xia
    BMC BIOINFORMATICS, 2016, 17
  • [7] A parallel model of DenseCNN and ordered-neuron LSTM for generic and species-specific succinylation site prediction
    Wang, Huiqing
    Zhao, Hong
    Zhang, Jing
    Han, Jiale
    Liu, Zhihao
    BIOTECHNOLOGY AND BIOENGINEERING, 2022, 119 (07) : 1755 - 1767
  • [8] A Comprehensive Comparative Review of Protein Sequence-Based Computational Prediction Models of Lysine Succinylation Sites
    Tasmia, Samme Amena
    Kibria, Md. Kaderi
    Islam, Md. Ariful
    Khatun, Mst Shamima
    Mollah, Md. Nurul Haque
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2022, 23 (11) : 744 - 756
  • [9] Prediction of S-Glutathionylation Sites Based on Protein Sequences
    Sun, Chenglei
    Shi, Zheng-Zheng
    Zhou, Xiaobo
    Chen, Luonan
    Zhao, Xing-Ming
    PLOS ONE, 2013, 8 (02):
  • [10] Prediction of protein crotonylation sites through LightGBM classifier based on SMOTE and elastic net
    Liu, Yaning
    Yu, Zhaomin
    Chen, Cheng
    Han, Yu
    Yu, Bin
    ANALYTICAL BIOCHEMISTRY, 2020, 609