Prediction of Protein Lysine Acylation by Integrating Primary Sequence Information with Multiple Functional Features

被引:24
|
作者
Du, Yipeng [1 ]
Zhai, Zichao [4 ]
Li, Ying [4 ]
Lu, Ming [4 ]
Cai, Tanxi [2 ,3 ]
Zhou, Bo [1 ,6 ]
Huang, Lei [7 ]
Wei, Taotao [1 ]
Li, Tingting [4 ,5 ]
机构
[1] Chinese Acad Sci, Natl Lab Biomacromol, Inst Biophys, Beijing 100101, Peoples R China
[2] Chinese Acad Sci, Lab Prot & Peptide Pharmaceut, Inst Biophys, Beijing 100101, Peoples R China
[3] Chinese Acad Sci, Lab Prote, Inst Biophys, Beijing 100101, Peoples R China
[4] Peking Univ, Sch Basic Med Sci, Hlth Sci Ctr, Dept Biomed Informat, Beijing 100191, Peoples R China
[5] Peking Univ, Sch Basic Med Sci, Hlth Sci Ctr, Inst Syst Biomed, Beijing 100191, Peoples R China
[6] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[7] Ocean Univ China, Coll Informat Sci & Engn, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
acylation; acetylation; malonylation; succinylation; glutarylation; prediction; proteomics; posttranslational modification; PTM; ACETYLATION SITES; SUCCINYLATION; MALONYLATION; IDENTIFICATION;
D O I
10.1021/acs.jproteome.6b00240
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Liquid chromatography tandem mass spectrometry (LC-MS/MS)based proteomic methods have been widely used to identify lysine acylation proteins. However, these experimental approaches often fail to detect proteins that are in low abundance or absent in specific biological samples. To circumvent these problems, we developed a computational method to predict lysine acylation, including acetylation, malonylation, succinylation, and glutarylation. The prediction algorithm integrated flanking primary sequence determinants and evolutionary conservation of acylated lysine as well as multiple protein functional annotation features including gene ontology, conserved domains, and protein protein interactions. The inclusion of functional annotation features increases predictive power oversimple sequence considerations for four of the acylation species evaluated. For example, the Matthews correlation coefficient (MCC) for the prediction of malonylation increased from 0.26 to 0.73. The performance of prediction was validated against an independent data set for malonylation. Likewise, when tested with independent data sets, the algorithm displayed improved sensitivity and specificity over existing methods. Experimental validation by Western blot experiments and LC-MS/ MS detection further attested to the performance of prediction. We then applied our algorithm on to the mouse proteome and reported the global-scale prediction of lysine acetylation, malonylation, succinylation, and glutarylation, which should serve as a valuable resource for future functional studies.
引用
收藏
页码:4234 / 4244
页数:11
相关论文
共 50 条
  • [1] Integrating protein secondary structure prediction and multiple sequence alignment
    Simossis, VA
    Heringa, J
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2004, 5 (04) : 249 - 266
  • [2] Current computational tools for protein lysine acylation site prediction
    Qin, Zhaohui
    Ren, Haoran
    Zhao, Pei
    Wang, Kaiyuan
    Liu, Huixia
    Miao, Chunbo
    Du, Yanxiu
    Li, Junzhou
    Wu, Liuji
    Chen, Zhen
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [3] PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features
    Nilamyani, Andi Nur
    Auliah, Firda Nurul
    Moni, Mohammad Ali
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (05) : 1 - 11
  • [4] PLMLA: prediction of lysine methylation and lysine acetylation by combining multiple features
    Shi, Shao-Ping
    Qiu, Jian-Ding
    Sun, Xing-Yu
    Suo, Sheng-Bao
    Huang, Shu-Yun
    Liang, Ru-Ping
    MOLECULAR BIOSYSTEMS, 2012, 8 (05) : 1520 - 1527
  • [5] Position-Specific Analysis and Prediction for Protein Lysine Acetylation Based on Multiple Features
    Suo, Sheng-Bao
    Qiu, Jian-Ding
    Shi, Shao-Ping
    Sun, Xing-Yu
    Huang, Shu-Yun
    Chen, Xiang
    Liang, Ru-Ping
    PLOS ONE, 2012, 7 (11):
  • [6] Enhanced prediction of protein functional identity through the integration of sequence and structural features
    Fujita, Suguru
    Terada, Tohru
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 4124 - 4130
  • [7] Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning
    Adhikari, Badri
    Hou, Jie
    Cheng, Jianlin
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 : 84 - 96
  • [8] PUP-Fuse: Prediction of Protein Pupylation Sites by Integrating Multiple Sequence Representations
    Auliah, Firda Nurul
    Nilamyani, Andi Nur
    Shoombuatong, Watshara
    Alam, Md Ashad
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (04) : 1 - 12
  • [9] Integrating multiple sequence features for identifying anticancer peptides
    Zou, Hongliang
    Yang, Fan
    Yin, Zhijian
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 99
  • [10] Gleaning structural and functional information from correlations in protein multiple sequence alignments
    Neuwald, Andrew F.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2016, 38 : 1 - 8