Prediction of Protein Lysine Acylation by Integrating Primary Sequence Information with Multiple Functional Features

被引:24
|
作者
Du, Yipeng [1 ]
Zhai, Zichao [4 ]
Li, Ying [4 ]
Lu, Ming [4 ]
Cai, Tanxi [2 ,3 ]
Zhou, Bo [1 ,6 ]
Huang, Lei [7 ]
Wei, Taotao [1 ]
Li, Tingting [4 ,5 ]
机构
[1] Chinese Acad Sci, Natl Lab Biomacromol, Inst Biophys, Beijing 100101, Peoples R China
[2] Chinese Acad Sci, Lab Prot & Peptide Pharmaceut, Inst Biophys, Beijing 100101, Peoples R China
[3] Chinese Acad Sci, Lab Prote, Inst Biophys, Beijing 100101, Peoples R China
[4] Peking Univ, Sch Basic Med Sci, Hlth Sci Ctr, Dept Biomed Informat, Beijing 100191, Peoples R China
[5] Peking Univ, Sch Basic Med Sci, Hlth Sci Ctr, Inst Syst Biomed, Beijing 100191, Peoples R China
[6] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[7] Ocean Univ China, Coll Informat Sci & Engn, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
acylation; acetylation; malonylation; succinylation; glutarylation; prediction; proteomics; posttranslational modification; PTM; ACETYLATION SITES; SUCCINYLATION; MALONYLATION; IDENTIFICATION;
D O I
10.1021/acs.jproteome.6b00240
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Liquid chromatography tandem mass spectrometry (LC-MS/MS)based proteomic methods have been widely used to identify lysine acylation proteins. However, these experimental approaches often fail to detect proteins that are in low abundance or absent in specific biological samples. To circumvent these problems, we developed a computational method to predict lysine acylation, including acetylation, malonylation, succinylation, and glutarylation. The prediction algorithm integrated flanking primary sequence determinants and evolutionary conservation of acylated lysine as well as multiple protein functional annotation features including gene ontology, conserved domains, and protein protein interactions. The inclusion of functional annotation features increases predictive power oversimple sequence considerations for four of the acylation species evaluated. For example, the Matthews correlation coefficient (MCC) for the prediction of malonylation increased from 0.26 to 0.73. The performance of prediction was validated against an independent data set for malonylation. Likewise, when tested with independent data sets, the algorithm displayed improved sensitivity and specificity over existing methods. Experimental validation by Western blot experiments and LC-MS/ MS detection further attested to the performance of prediction. We then applied our algorithm on to the mouse proteome and reported the global-scale prediction of lysine acetylation, malonylation, succinylation, and glutarylation, which should serve as a valuable resource for future functional studies.
引用
收藏
页码:4234 / 4244
页数:11
相关论文
共 50 条
  • [31] Core column prediction for protein multiple sequence alignments
    DeBlasio, Dan
    Kececioglu, John
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2017, 12
  • [32] Core column prediction for protein multiple sequence alignments
    Dan DeBlasio
    John Kececioglu
    Algorithms for Molecular Biology, 12
  • [33] Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
    Yuan Li
    Mingjun Wang
    Huilin Wang
    Hao Tan
    Ziding Zhang
    Geoffrey I. Webb
    Jiangning Song
    Scientific Reports, 4
  • [34] Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
    Li, Yuan
    Wang, Mingjun
    Wang, Huilin
    Tan, Hao
    Zhang, Ziding
    Webb, Geoffrey I.
    Song, Jiangning
    SCIENTIFIC REPORTS, 2014, 4
  • [35] iLMS, Computational Identification of Lysine-Malonylation Sites by Combining Multiple Sequence Features
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    PROCEEDINGS 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2018, : 356 - 359
  • [36] Integrating sequence and graph information for enhanced drug-target affinity prediction
    Haohuai HE
    Guanxing CHEN
    Calvin Yu-Chian CHEN
    Science China(Information Sciences), 2024, 67 (02) : 325 - 326
  • [37] Prediction of protein interaction types based on sequence and network features
    Goebels, Florian
    Frishman, Dmitrij
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [38] Integrating sequence and graph information for enhanced drug-target affinity prediction
    He, Haohuai
    Chen, Guanxing
    Chen, Calvin Yu-Chian
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)
  • [39] FLEXc: protein flexibility prediction using context-based statistics, predicted structural features, and sequence information
    Yaseen, Ashraf
    Nijim, Mais
    Williams, Brandon
    Qian, Lei
    Li, Min
    Wang, Jianxin
    Li, Yaohang
    BMC BIOINFORMATICS, 2016, 17
  • [40] Complementing sequence-derived features with structural information extracted from fragment libraries for protein structure prediction
    Liu, Siyuan
    Wang, Tong
    Xu, Qijiang
    Shao, Bin
    Yin, Jian
    Liu, Tie-Yan
    BMC BIOINFORMATICS, 2021, 22 (01)