Prediction of Protein S-Nitrosylation Sites Based on Adapted Normal Distribution Bi-Profile Bayes and Chou's Pseudo Amino Acid Composition

被引:89
|
作者
Jia, Cangzhi [1 ]
Lin, Xin [1 ]
Wang, Zhiping [1 ]
机构
[1] Dalian Maritime Univ, Dept Math, Dalian 116026, Peoples R China
关键词
S-nitrosylation; post-translational modification; bi-profile Bayes; amino acid physicochemical properties; ACCURATE PREDICTION; PSEAAC; IDENTIFICATION;
D O I
10.3390/ijms150610410
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein S-nitrosylation is a reversible post-translational modification by covalent modification on the thiol group of cysteine residues by nitric oxide. Growing evidence shows that protein S-nitrosylation plays an important role in normal cellular function as well as in various pathophysiologic conditions. Because of the inherent chemical instability of the S-NO bond and the low abundance of endogenous S-nitrosylated proteins, the unambiguous identification of S-nitrosylation sites by commonly used proteomic approaches remains challenging. Therefore, computational prediction of S-nitrosylation sites has been considered as a powerful auxiliary tool. In this work, we mainly adopted an adapted normal distribution bi-profile Bayes (ANBPB) feature extraction model to characterize the distinction of position-specific amino acids in 784 S-nitrosylated and 1568 non-S-nitrosylated peptide sequences. We developed a support vector machine prediction model, iSNO-ANBPB, by incorporating ANBPB with the Chou's pseudo amino acid composition. In jackknife cross-validation experiments, iSNO-ANBPB yielded an accuracy of 65.39% and a Matthew's correlation coefficient (MCC) of 0.3014. When tested on an independent dataset, iSNO-ANBPB achieved an accuracy of 63.41% and a MCC of 0.2984, which are much higher than the values achieved by the existing predictors SNOSite, iSNO-PseAAC, the Li et al. algorithm, and iSNO-AAPair. On another training dataset, iSNO-ANBPB also outperformed GPS-SNO and iSNO-PseAAC in the 10-fold crossvalidation test.
引用
收藏
页码:10410 / 10423
页数:14
相关论文
共 28 条
  • [21] A novel alignment-free method to classify protein folding types by combining spectral graph clustering with Chou's pseudo amino acid composition
    Tripathi, Pooja
    Pandey, Paras N.
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 424 : 49 - 54
  • [22] A Multilabel Model Based on Chou's Pseudo-Amino Acid Composition for Identifying Membrane Proteins with Both Single and Multiple Functional Types
    Huang, Chao
    Yuan, Jing-Qi
    JOURNAL OF MEMBRANE BIOLOGY, 2013, 246 (04) : 327 - 334
  • [23] Prediction subcellular localization of Gram-negative bacterial proteins by support vector machine using wavelet denoising and Chou's pseudo amino acid composition
    Yu, Bin
    Li, Shan
    Chen, Cheng
    Xu, Jiameng
    Qiu, Wenying
    Wu, Xue
    Chen, Ruixin
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 167 : 102 - 112
  • [24] Using radial basis function on the general form of Chou's pseudo amino acid composition and PSSM to predict subcellular locations of proteins with both single and multiple sites
    Huang, Chao
    Yuan, Jingqi
    BIOSYSTEMS, 2013, 113 (01) : 50 - 57
  • [25] OOgenesis_Pred: A sequence-based method for predicting oogenesis proteins by six different modes of Chou's pseudo amino acid composition
    Rahimi, Maryam
    Bakhtiarizadeh, Mohammad Reza
    Mohammadi-Sangcheshmeh, Abdollah
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 414 : 128 - 136
  • [26] Prediction of Golgi-resident protein types using general form of Chou's pseudo-amino acid compositions: Approaches with minimal redundancy maximal relevance feature selection
    Jiao, Ya-Sen
    Du, Pu-Feng
    JOURNAL OF THEORETICAL BIOLOGY, 2016, 402 : 38 - 44
  • [27] iPhosH-PseAAC: Identify Phosphohistidine Sites in Proteins by Blending Statistical Moments and Position Relative Features According to the Chou's 5-Step Rule and General Pseudo Amino Acid Composition
    Awais, Muhammad
    Hussain, Waqar
    Khan, Yaser Daanial
    Rasool, Nouman
    Khan, Sher Afzal
    Chou, Kuo-Chen
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (02) : 596 - 610
  • [28] iHyd-PseAAC (EPSV): Identifying Hydroxylation Sites in Proteins by Extracting Enhanced Position and Sequence Variant Feature via Chou's 5-Step Rule and General Pseudo Amino Acid Composition
    Ehsan, Asma
    Mahmood, Muhammad K.
    Khan, Yaser D.
    Barukab, Omar M.
    Khan, Sher A.
    Chou, Kuo-Chen
    CURRENT GENOMICS, 2019, 20 (02) : 124 - 133