Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method

被引:7
|
作者
Gao, Jianzhao [1 ,2 ]
Tao, Xue-Wen [3 ]
Zhao, Jia [4 ]
Feng, Yuan-Ming [3 ]
Cai, Yu-Dong [5 ]
Zhang, Ning [3 ]
机构
[1] Nankai Univ, Sch Math Sci, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Tianjin Univ, Tianjin Key Lab Biomed Engn Measurement, Dept Biomed Engn, Tianjin, Peoples R China
[4] CODBIO Co Ltd, Biomed Res Ctr, Tianjin, Peoples R China
[5] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Acetylation; post-translational modification; dagging; maximum relevance minimum redundancy; incremental feature selection; epsilon lysine acetylation site; NONHISTONE PROTEINS; METHYLATION; DISORDER; SUBSTRATE; SEQUENCES; DATABASE; TARGETS; BINDING; SETS;
D O I
10.2174/1386207320666170314093216
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aim and Objective: Lysine acetylation, as one type of post-translational modifications (PTM), plays key roles in cellular regulations and can be involved in a variety of human diseases. However, it is often high-cost and time-consuming to use traditional experimental approaches to identify the lysine acetylation sites. Therefore, effective computational methods should be developed to predict the acetylation sites. In this study, we developed a position-specific method for epsilon lysine acetylation site prediction. Material and Methods: Sequences of acetylated proteins were retrieved from the UniProt database. Various kinds of features such as position specific scoring matrix (PSSM), amino acid factors (AAF), and disorders were incorporated. A feature selection method based on mRMR (Maximum Relevance Minimum Redundancy) and IFS (Incremental Feature Selection) was employed. Results: Finally, 319 optimal features were selected from total 541 features. Using the 319 optimal features to encode peptides, a predictor was constructed based on dagging. As a result, an accuracy of 69.56% with MCC of 0.2792 was achieved. We analyzed the optimal features, which suggested some important factors determining the lysine acetylation sites. Conclusion: We developed a position-specific method for epsilon lysine acetylation site prediction. A set of optimal features was selected. Analysis of the optimal features provided insights into the mechanism of lysine acetylation sites, providing guidance of experimental validation.
引用
收藏
页码:629 / 637
页数:9
相关论文
共 50 条
  • [31] Feature Selection for the Prediction of Translation Initiation Sites
    Guo-Liang Li* and Tze-Yun Leong Medical Computing Laboratory
    Genomics Proteomics & Bioinformatics, 2005, (02) : 73 - 83
  • [32] MSTL-Kace: Prediction of Prokaryotic Lysine Acetylation Sites Based on Multistage Transfer Learning Strategy
    Wang, Gang-Ao
    Yan, Xiaodi
    Li, Xiang
    Liu, Yinbo
    Xia, Junfeng
    Zhu, Xiaolei
    ACS OMEGA, 2023, 8 (44): : 41930 - 41942
  • [33] A new feature selection method for computational prediction of type III secreted effectors
    Yang, Yang
    Qi, Sihui
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 10 (04) : 440 - 454
  • [34] Recent Trends on the Development of Machine Learning Approaches for the Prediction of Lysine Acetylation Sites
    Basith, Shaherin
    Chang, Hye Jin
    Nithiyanandam, Saraswathy
    Shin, Tae Hwan
    Manavalan, Balachandran
    Lee, Gwang
    CURRENT MEDICINAL CHEMISTRY, 2022, 29 (02) : 235 - 250
  • [35] Lysine acetylation sites prediction using an ensemble of support vector machine classifiers
    Xu, Yan
    Wang, Xiao-Bo
    Ding, Jun
    Wu, Ling-Yun
    Deng, Nai-Yang
    JOURNAL OF THEORETICAL BIOLOGY, 2010, 264 (01) : 130 - 135
  • [36] Characterization and Prediction of Lysine (K)-Acetyl-Transferase Specific Acetylation Sites
    Li, Tingting
    Du, Yipeng
    Wang, Likun
    Huang, Lei
    Li, Wenlin
    Lu, Ming
    Zhang, Xuegong
    Zhu, Wei-Guo
    MOLECULAR & CELLULAR PROTEOMICS, 2012, 11 (01)
  • [37] A method to distinguish between lysine acetylation and lysine methylation from protein sequences
    Shi, Shao-Ping
    Qiu, Jian-Ding
    Sun, Xing-Yu
    Suo, Sheng-Bao
    Huang, Shu-Yun
    Liang, Ru-Ping
    JOURNAL OF THEORETICAL BIOLOGY, 2012, 310 : 223 - 230
  • [38] A computational method for the analysis and prediction of protein: phosphopeptide-binding sites
    Joughin, BA
    Tidor, B
    Yaffe, MB
    PROTEIN SCIENCE, 2005, 14 (01) : 131 - 139
  • [39] Prediction of protein structural classes based on feature selection technique
    Hui Ding
    Hao Lin
    Wei Chen
    Zi-Qiang Li
    Feng-Biao Guo
    Jian Huang
    Nini Rao
    Interdisciplinary Sciences: Computational Life Sciences, 2014, 6 : 235 - 240
  • [40] Integrative approaches to the prediction of protein functions based on the feature selection
    Ko, Seokha
    Lee, Hyunju
    BMC BIOINFORMATICS, 2009, 10