An Intelligent System for Identifying Acetylated Lysine on Histones and Nonhistone Proteins

被引:20
作者
Lu, Cheng-Tsung [1 ]
Lee, Tzong-Yi [1 ]
Chen, Yu-Ju [2 ]
Chen, Yi-Ju [2 ]
机构
[1] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan 320, Taiwan
[2] Acad Sinica, Inst Chem, Taipei 115, Taiwan
关键词
MAXIMAL DEPENDENCE DECOMPOSITION; PHOSPHORYLATION SITES; PREDICTION; SIRT1; IDENTIFICATION; SUBSTRATE; SEQUENCES; PROTECTS; DISEASE; MODELS;
D O I
10.1155/2014/528650
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Lysine acetylation is an important and ubiquitous posttranslational modification conserved in prokaryotes and eukaryotes. This process, which is dynamically and temporally regulated by histone acetyltransferases and deacetylases, is crucial for numerous essential biological processes such as transcriptional regulation, cellular signaling, and stress response. Since the experimental identification of lysine acetylation sites within proteins is time-consuming and laboratory-intensive, several computational approaches have been developed to identify candidates for experimental validation. In this work, acetylated protein data collected from UniProtKB were categorized into histone or nonhistone proteins. Support vector machines (SVMs) were applied to build predictive models by using amino acid pair composition (AAPC) as a feature in a histone model. We combined BLOSUM62 and AAPC features in a nonhistone model. Furthermore, using maximal dependence decomposition (MDD) clustering can enhance the performance of the model on a fivefold cross-validation evaluation to yield a sensitivity of 0.863, specificity of 0.885, accuracy of 0.880, and MCC of 0.706. Additionally, the proposed method is evaluated using independent test sets resulting in a predictive accuracy of 74%. This indicates that the performance of our method is comparable with that of other acetylation prediction methods.
引用
收藏
页数:11
相关论文
共 41 条
[1]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[2]   Identifying Protein Phosphorylation Sites with Kinase Substrate Specificity on Human Viruses [J].
Bretana, Neil Arvin ;
Lu, Cheng-Tsung ;
Chiang, Chiu-Yun ;
Su, Min-Gang ;
Huang, Kai-Yao ;
Lee, Tzong-Yi ;
Weng, Shun-Long .
PLOS ONE, 2012, 7 (07)
[3]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]   VirusMINT: a viral protein interaction database [J].
Chatr-aryamontri, Andrew ;
Ceol, Arnaud ;
Peluso, Daniele ;
Nardozza, Aurelio ;
Panni, Simona ;
Sacco, Francesca ;
Tinti, Michele ;
Smolyar, Alex ;
Castagnoli, Luisa ;
Vidal, Marc ;
Cusick, Michael E. ;
Cesareni, Gianni .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D669-D673
[6]   Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions [J].
Choudhary, Chunaram ;
Kumar, Chanchal ;
Gnad, Florian ;
Nielsen, Michael L. ;
Rehman, Michael ;
Walther, Tobias C. ;
Olsen, Jesper V. ;
Mann, Matthias .
SCIENCE, 2009, 325 (5942) :834-840
[7]   Calorie restriction promotes mammalian cell survival by inducing the SIRT1 deacetylase [J].
Cohen, HY ;
Miller, C ;
Bitterman, KJ ;
Wall, NR ;
Hekking, B ;
Kessler, B ;
Howitz, KT ;
Gorospe, M ;
de Cabo, R ;
Sinclair, DA .
SCIENCE, 2004, 305 (5682) :390-392
[8]   WebLogo: A sequence logo generator [J].
Crooks, GE ;
Hon, G ;
Chandonia, JM ;
Brenner, SE .
GENOME RESEARCH, 2004, 14 (06) :1188-1190
[9]   Cancer epigenomics: DNA methylomes and histone-modification maps [J].
Esteller, Manel .
NATURE REVIEWS GENETICS, 2007, 8 (04) :286-298
[10]   Loss of acetylation at Lys16 and trimethylation at Lys20 of histone H4 is a common hallmark of human cancer [J].
Fraga, MF ;
Ballestar, E ;
Villar-Garea, A ;
Boix-Chornet, M ;
Espada, J ;
Schotta, G ;
Bonaldi, T ;
Haydon, C ;
Ropero, S ;
Petrie, K ;
Iyer, NG ;
Pérez-Rosado, A ;
Calvo, E ;
Lopez, JA ;
Cano, A ;
Calasanz, MJ ;
Colomer, D ;
Piris, MA ;
Ahn, N ;
Imhof, A ;
Caldas, C ;
Jenuwein, T ;
Esteller, M .
NATURE GENETICS, 2005, 37 (04) :391-400