Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method

被引:7
|
作者
Gao, Jianzhao [1 ,2 ]
Tao, Xue-Wen [3 ]
Zhao, Jia [4 ]
Feng, Yuan-Ming [3 ]
Cai, Yu-Dong [5 ]
Zhang, Ning [3 ]
机构
[1] Nankai Univ, Sch Math Sci, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Tianjin Univ, Tianjin Key Lab Biomed Engn Measurement, Dept Biomed Engn, Tianjin, Peoples R China
[4] CODBIO Co Ltd, Biomed Res Ctr, Tianjin, Peoples R China
[5] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Acetylation; post-translational modification; dagging; maximum relevance minimum redundancy; incremental feature selection; epsilon lysine acetylation site; NONHISTONE PROTEINS; METHYLATION; DISORDER; SUBSTRATE; SEQUENCES; DATABASE; TARGETS; BINDING; SETS;
D O I
10.2174/1386207320666170314093216
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aim and Objective: Lysine acetylation, as one type of post-translational modifications (PTM), plays key roles in cellular regulations and can be involved in a variety of human diseases. However, it is often high-cost and time-consuming to use traditional experimental approaches to identify the lysine acetylation sites. Therefore, effective computational methods should be developed to predict the acetylation sites. In this study, we developed a position-specific method for epsilon lysine acetylation site prediction. Material and Methods: Sequences of acetylated proteins were retrieved from the UniProt database. Various kinds of features such as position specific scoring matrix (PSSM), amino acid factors (AAF), and disorders were incorporated. A feature selection method based on mRMR (Maximum Relevance Minimum Redundancy) and IFS (Incremental Feature Selection) was employed. Results: Finally, 319 optimal features were selected from total 541 features. Using the 319 optimal features to encode peptides, a predictor was constructed based on dagging. As a result, an accuracy of 69.56% with MCC of 0.2792 was achieved. We analyzed the optimal features, which suggested some important factors determining the lysine acetylation sites. Conclusion: We developed a position-specific method for epsilon lysine acetylation site prediction. A set of optimal features was selected. Analysis of the optimal features provided insights into the mechanism of lysine acetylation sites, providing guidance of experimental validation.
引用
收藏
页码:629 / 637
页数:9
相关论文
共 50 条
  • [21] mChIP-KAT-MS, a method to map protein interactions and acetylation sites for lysine acetyltransferases
    Mitchell, Leslie
    Huard, Sylvain
    Cotrut, Michael
    Pourhanifeh-Lemeri, Roghayeh
    Steunou, Anne-Lise
    Hamza, Akil
    Lambert, Jean-Philippe
    Zhou, Hu
    Ning, Zhibin
    Basu, Amrita
    Cote, Jacques
    Figeys, Daniel A.
    Baetz, Kristin
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (17) : E1641 - E1650
  • [22] Prediction of Lysine Ubiquitylation with Ensemble Classifier and Feature Selection
    Zhao, Xiaowei
    Li, Xiangtao
    Ma, Zhiqiang
    Yin, Minghao
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2011, 12 (12) : 8347 - 8361
  • [23] Prediction of lysine ubiquitination with mRMR feature selection and analysis
    Cai, Yudong
    Huang, Tao
    Hu, Lele
    Shi, Xiaohe
    Xie, Lu
    Li, Yixue
    AMINO ACIDS, 2012, 42 (04) : 1387 - 1395
  • [24] Prediction of lysine ubiquitination with mRMR feature selection and analysis
    Yudong Cai
    Tao Huang
    Lele Hu
    Xiaohe Shi
    Lu Xie
    Yixue Li
    Amino Acids, 2012, 42 : 1387 - 1395
  • [25] FrankSum: New feature selection method for protein function prediction
    Al-Shahib, A
    Breitling, R
    Gilbert, D
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (04) : 259 - 275
  • [26] Position-Specific Analysis and Prediction for Protein Lysine Acetylation Based on Multiple Features
    Suo, Sheng-Bao
    Qiu, Jian-Ding
    Shi, Shao-Ping
    Sun, Xing-Yu
    Huang, Shu-Yun
    Chen, Xiang
    Liang, Ru-Ping
    PLOS ONE, 2012, 7 (11):
  • [27] Mal-Lys: prediction of lysine malonylation sites in proteins integrated sequence-based features with mRMR feature selection
    Xu, Yan
    Ding, Ya-Xin
    Ding, Jun
    Wu, Ling-Yun
    Xue, Yu
    SCIENTIFIC REPORTS, 2016, 6
  • [28] Mal-Lys: prediction of lysine malonylation sites in proteins integrated sequence-based features with mRMR feature selection
    Yan Xu
    Ya-Xin Ding
    Jun Ding
    Ling-Yun Wu
    Yu Xue
    Scientific Reports, 6
  • [29] A hybrid feature selection method for predicting lysine malonylation sites in proteins via machine learning
    Rajabiun, Hananeh
    MohammadHoseini, Mahdis
    Zarezadeh, Hadi
    Delkhosh, Mehdi
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 222
  • [30] Mal-Prec: computational prediction of protein Malonylation sites via machine learning based feature integrationMalonylation site prediction
    Xin Liu
    Liang Wang
    Jian Li
    Junfeng Hu
    Xiao Zhang
    BMC Genomics, 21