Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method

被引:7
作者
Gao, Jianzhao [1 ,2 ]
Tao, Xue-Wen [3 ]
Zhao, Jia [4 ]
Feng, Yuan-Ming [3 ]
Cai, Yu-Dong [5 ]
Zhang, Ning [3 ]
机构
[1] Nankai Univ, Sch Math Sci, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Tianjin Univ, Tianjin Key Lab Biomed Engn Measurement, Dept Biomed Engn, Tianjin, Peoples R China
[4] CODBIO Co Ltd, Biomed Res Ctr, Tianjin, Peoples R China
[5] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Acetylation; post-translational modification; dagging; maximum relevance minimum redundancy; incremental feature selection; epsilon lysine acetylation site; NONHISTONE PROTEINS; METHYLATION; DISORDER; SUBSTRATE; SEQUENCES; DATABASE; TARGETS; BINDING; SETS;
D O I
10.2174/1386207320666170314093216
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aim and Objective: Lysine acetylation, as one type of post-translational modifications (PTM), plays key roles in cellular regulations and can be involved in a variety of human diseases. However, it is often high-cost and time-consuming to use traditional experimental approaches to identify the lysine acetylation sites. Therefore, effective computational methods should be developed to predict the acetylation sites. In this study, we developed a position-specific method for epsilon lysine acetylation site prediction. Material and Methods: Sequences of acetylated proteins were retrieved from the UniProt database. Various kinds of features such as position specific scoring matrix (PSSM), amino acid factors (AAF), and disorders were incorporated. A feature selection method based on mRMR (Maximum Relevance Minimum Redundancy) and IFS (Incremental Feature Selection) was employed. Results: Finally, 319 optimal features were selected from total 541 features. Using the 319 optimal features to encode peptides, a predictor was constructed based on dagging. As a result, an accuracy of 69.56% with MCC of 0.2792 was achieved. We analyzed the optimal features, which suggested some important factors determining the lysine acetylation sites. Conclusion: We developed a position-specific method for epsilon lysine acetylation site prediction. A set of optimal features was selected. Analysis of the optimal features provided insights into the mechanism of lysine acetylation sites, providing guidance of experimental validation.
引用
收藏
页码:629 / 637
页数:9
相关论文
共 56 条
[1]   Acetylation and deacetylation-novel factors in muscle wasting [J].
Alamdari, Nima ;
Aversa, Zaira ;
Castillero, Estibaliz ;
Hasselgren, Per-Olof .
METABOLISM-CLINICAL AND EXPERIMENTAL, 2013, 62 (01) :1-11
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Solving the protein sequence metric problem [J].
Atchley, WR ;
Zhao, JP ;
Fernandes, AD ;
Drüke, T .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (18) :6395-6400
[4]   Protein deacetylation by sirtuins: delineating a post-translational regulatory program responsive to nutrient and redox stressors [J].
Bao, Jianjun ;
Sack, Michael N. .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2010, 67 (18) :3073-3087
[5]   Proteome-wide prediction of acetylation substrates [J].
Basu, Amrita ;
Rose, Kristie L. ;
Zhang, Junmei ;
Beavis, Ronald C. ;
Ueberheide, Beatrix ;
Garcia, Benjamin A. ;
Chait, Brian ;
Zhao, Yingming ;
Hunt, Donald F. ;
Segal, Eran ;
Allis, C. David ;
Hake, Sandra B. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (33) :13785-13790
[6]   C-terminal end and aminoacid Lys48 in HMG-CoA lyase are involved in substrate binding and enzyme activity [J].
Carrasco, Patricia ;
Menao, Sebastian ;
Lopez-Vinas, Eduardo ;
Santpere, Gabriel ;
Clotet, Josep ;
Sierra, Adriana Y. ;
Gratacos, Esther ;
Puisac, Beatriz ;
Gomez-Puertas, Paulino ;
Hegardt, Fausto G. ;
Pie, Juan ;
Casals, Nuria .
MOLECULAR GENETICS AND METABOLISM, 2007, 91 (02) :120-127
[7]   Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions [J].
Choudhary, Chunaram ;
Kumar, Chanchal ;
Gnad, Florian ;
Nielsen, Michael L. ;
Rehman, Michael ;
Walther, Tobias C. ;
Olsen, Jesper V. ;
Mann, Matthias .
SCIENCE, 2009, 325 (5942) :834-840
[8]   Computational refinement of post-translational modifications predicted from tandem mass spectrometry [J].
Chung, Clement ;
Liu, Jian ;
Emili, Andrew ;
Frey, Brendan J. .
BIOINFORMATICS, 2011, 27 (06) :797-806
[9]  
Daily KM, 2005, PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, P475
[10]   Transcriptional regulation by the acetylation of nonhistone proteins in humans - A new target for therapeutics [J].
Das, C ;
Kundu, TK .
IUBMB LIFE, 2005, 57 (03) :137-148