Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method

被引:7
作者
Gao, Jianzhao [1 ,2 ]
Tao, Xue-Wen [3 ]
Zhao, Jia [4 ]
Feng, Yuan-Ming [3 ]
Cai, Yu-Dong [5 ]
Zhang, Ning [3 ]
机构
[1] Nankai Univ, Sch Math Sci, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Tianjin Univ, Tianjin Key Lab Biomed Engn Measurement, Dept Biomed Engn, Tianjin, Peoples R China
[4] CODBIO Co Ltd, Biomed Res Ctr, Tianjin, Peoples R China
[5] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Acetylation; post-translational modification; dagging; maximum relevance minimum redundancy; incremental feature selection; epsilon lysine acetylation site; NONHISTONE PROTEINS; METHYLATION; DISORDER; SUBSTRATE; SEQUENCES; DATABASE; TARGETS; BINDING; SETS;
D O I
10.2174/1386207320666170314093216
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aim and Objective: Lysine acetylation, as one type of post-translational modifications (PTM), plays key roles in cellular regulations and can be involved in a variety of human diseases. However, it is often high-cost and time-consuming to use traditional experimental approaches to identify the lysine acetylation sites. Therefore, effective computational methods should be developed to predict the acetylation sites. In this study, we developed a position-specific method for epsilon lysine acetylation site prediction. Material and Methods: Sequences of acetylated proteins were retrieved from the UniProt database. Various kinds of features such as position specific scoring matrix (PSSM), amino acid factors (AAF), and disorders were incorporated. A feature selection method based on mRMR (Maximum Relevance Minimum Redundancy) and IFS (Incremental Feature Selection) was employed. Results: Finally, 319 optimal features were selected from total 541 features. Using the 319 optimal features to encode peptides, a predictor was constructed based on dagging. As a result, an accuracy of 69.56% with MCC of 0.2792 was achieved. We analyzed the optimal features, which suggested some important factors determining the lysine acetylation sites. Conclusion: We developed a position-specific method for epsilon lysine acetylation site prediction. A set of optimal features was selected. Analysis of the optimal features provided insights into the mechanism of lysine acetylation sites, providing guidance of experimental validation.
引用
收藏
页码:629 / 637
页数:9
相关论文
共 56 条
[51]   AMINO-TERMINAL ALANINE FUNCTIONS IN A CALCIUM-SPECIFIC PROCESS ESSENTIAL FOR MEMBRANE-BINDING BY PROTHROMBIN FRAGMENT-1 [J].
WELSCH, DJ ;
NELSESTUEN, GL .
BIOCHEMISTRY, 1988, 27 (13) :4939-4945
[52]   Lysine acetylation sites prediction using an ensemble of support vector machine classifiers [J].
Xu, Yan ;
Wang, Xiao-Bo ;
Ding, Jun ;
Wu, Ling-Yun ;
Deng, Nai-Yang .
JOURNAL OF THEORETICAL BIOLOGY, 2010, 264 (01) :130-135
[53]   Lysine acetylation and the bromodomain: a new partnership for signaling [J].
Yang, XJ .
BIOESSAYS, 2004, 26 (10) :1076-1087
[54]   Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis [J].
Zhang, Ning ;
Zhou, You ;
Huang, Tao ;
Zhang, Yu-Chao ;
Li, Bi-Qing ;
Chen, Lei ;
Cai, Yu-Dong .
PLOS ONE, 2014, 9 (09)
[55]   Computational prediction and analysis of protein γ-carboxylation sites based on a random forest method [J].
Zhang, Ning ;
Li, Bi-Qing ;
Gao, Shan ;
Ruan, Ji-Shou ;
Cai, Yu-Dong .
MOLECULAR BIOSYSTEMS, 2012, 8 (11) :2946-2955
[56]   Regulation of Cellular Metabolism by Protein Lysine Acetylation [J].
Zhao, Shimin ;
Xu, Wei ;
Jiang, Wenqing ;
Yu, Wei ;
Lin, Yan ;
Zhang, Tengfei ;
Yao, Jun ;
Zhou, Li ;
Zeng, Yaxue ;
Li, Hong ;
Li, Yixue ;
Shi, Jiong ;
An, Wenlin ;
Hancock, Susan M. ;
He, Fuchu ;
Qin, Lunxiu ;
Chin, Jason ;
Yang, Pengyuan ;
Chen, Xian ;
Lei, Qunying ;
Xiong, Yue ;
Guan, Kun-Liang .
SCIENCE, 2010, 327 (5968) :1000-1004