Prediction of Protein Lysine Acylation by Integrating Primary Sequence Information with Multiple Functional Features

被引:24
|
作者
Du, Yipeng [1 ]
Zhai, Zichao [4 ]
Li, Ying [4 ]
Lu, Ming [4 ]
Cai, Tanxi [2 ,3 ]
Zhou, Bo [1 ,6 ]
Huang, Lei [7 ]
Wei, Taotao [1 ]
Li, Tingting [4 ,5 ]
机构
[1] Chinese Acad Sci, Natl Lab Biomacromol, Inst Biophys, Beijing 100101, Peoples R China
[2] Chinese Acad Sci, Lab Prot & Peptide Pharmaceut, Inst Biophys, Beijing 100101, Peoples R China
[3] Chinese Acad Sci, Lab Prote, Inst Biophys, Beijing 100101, Peoples R China
[4] Peking Univ, Sch Basic Med Sci, Hlth Sci Ctr, Dept Biomed Informat, Beijing 100191, Peoples R China
[5] Peking Univ, Sch Basic Med Sci, Hlth Sci Ctr, Inst Syst Biomed, Beijing 100191, Peoples R China
[6] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[7] Ocean Univ China, Coll Informat Sci & Engn, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
acylation; acetylation; malonylation; succinylation; glutarylation; prediction; proteomics; posttranslational modification; PTM; ACETYLATION SITES; SUCCINYLATION; MALONYLATION; IDENTIFICATION;
D O I
10.1021/acs.jproteome.6b00240
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Liquid chromatography tandem mass spectrometry (LC-MS/MS)based proteomic methods have been widely used to identify lysine acylation proteins. However, these experimental approaches often fail to detect proteins that are in low abundance or absent in specific biological samples. To circumvent these problems, we developed a computational method to predict lysine acylation, including acetylation, malonylation, succinylation, and glutarylation. The prediction algorithm integrated flanking primary sequence determinants and evolutionary conservation of acylated lysine as well as multiple protein functional annotation features including gene ontology, conserved domains, and protein protein interactions. The inclusion of functional annotation features increases predictive power oversimple sequence considerations for four of the acylation species evaluated. For example, the Matthews correlation coefficient (MCC) for the prediction of malonylation increased from 0.26 to 0.73. The performance of prediction was validated against an independent data set for malonylation. Likewise, when tested with independent data sets, the algorithm displayed improved sensitivity and specificity over existing methods. Experimental validation by Western blot experiments and LC-MS/ MS detection further attested to the performance of prediction. We then applied our algorithm on to the mouse proteome and reported the global-scale prediction of lysine acetylation, malonylation, succinylation, and glutarylation, which should serve as a valuable resource for future functional studies.
引用
收藏
页码:4234 / 4244
页数:11
相关论文
共 50 条
  • [41] Complementing sequence-derived features with structural information extracted from fragment libraries for protein structure prediction
    Siyuan Liu
    Tong Wang
    Qijiang Xu
    Bin Shao
    Jian Yin
    Tie-Yan Liu
    BMC Bioinformatics, 22
  • [42] Improving Protein Structural Class Prediction Using Novel Combined Sequence Information and Predicted Secondary Structural Features
    Dai, Qi
    Wu, Li
    Li, Lihua
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2011, 32 (16) : 3393 - 3398
  • [43] NetTurnP - Neural Network Prediction of Beta-turns by Use of Evolutionary Information and Predicted Protein Sequence Features
    Petersen, Bent
    Lundegaard, Claus
    Petersen, Thomas Nordahl
    PLOS ONE, 2010, 5 (11):
  • [44] FLEXc: protein flexibility prediction using context-based statistics, predicted structural features, and sequence information
    Ashraf Yaseen
    Mais Nijim
    Brandon Williams
    Lei Qian
    Min Li
    Jianxin Wang
    Yaohang Li
    BMC Bioinformatics, 17
  • [45] Key protein identification by integrating protein complex information and multi-biological features
    Han, Yongyin
    Liu, Maolin
    Wang, Zhixiao
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (10) : 18191 - 18206
  • [46] Granular multiple kernel learning for identifying RNA-binding protein residues via integrating sequence and structure information
    Yang, Chao
    Ding, Yijie
    Meng, Qiaozhen
    Tang, Jijun
    Guo, Fei
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (17): : 11387 - 11399
  • [47] Granular multiple kernel learning for identifying RNA-binding protein residues via integrating sequence and structure information
    Chao Yang
    Yijie Ding
    Qiaozhen Meng
    Jijun Tang
    Fei Guo
    Neural Computing and Applications, 2021, 33 : 11387 - 11399
  • [48] Multiple Profile Models Extract Features from Protein Sequence Data and Resolve Functional Diversity of Very Different Protein Families
    Vicedomini, R.
    Bouly, J. P.
    Laine, E.
    Falciatore, A.
    Carbone, A.
    MOLECULAR BIOLOGY AND EVOLUTION, 2022, 39 (04)
  • [49] Analysis and Prediction of Translation Rate Based on Sequence and Functional Features of the mRNA
    Huang, Tao
    Wan, Sibao
    Xu, Zhongping
    Zheng, Yufang
    Feng, Kai-Yan
    Li, Hai-Peng
    Kong, Xiangyin
    Cai, Yu-Dong
    PLOS ONE, 2011, 6 (01):
  • [50] Alignment of multiple protein structures based on sequence and structure features
    Madhusudhan, M. S.
    Webb, Benjamin M.
    Marti-Renom, Marc A.
    Eswar, Narayanan
    Sali, Andrej
    PROTEIN ENGINEERING DESIGN & SELECTION, 2009, 22 (09): : 569 - 574