An Intelligent System for Identifying Acetylated Lysine on Histones and Nonhistone Proteins

被引:20
作者
Lu, Cheng-Tsung [1 ]
Lee, Tzong-Yi [1 ]
Chen, Yu-Ju [2 ]
Chen, Yi-Ju [2 ]
机构
[1] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan 320, Taiwan
[2] Acad Sinica, Inst Chem, Taipei 115, Taiwan
关键词
MAXIMAL DEPENDENCE DECOMPOSITION; PHOSPHORYLATION SITES; PREDICTION; SIRT1; IDENTIFICATION; SUBSTRATE; SEQUENCES; PROTECTS; DISEASE; MODELS;
D O I
10.1155/2014/528650
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Lysine acetylation is an important and ubiquitous posttranslational modification conserved in prokaryotes and eukaryotes. This process, which is dynamically and temporally regulated by histone acetyltransferases and deacetylases, is crucial for numerous essential biological processes such as transcriptional regulation, cellular signaling, and stress response. Since the experimental identification of lysine acetylation sites within proteins is time-consuming and laboratory-intensive, several computational approaches have been developed to identify candidates for experimental validation. In this work, acetylated protein data collected from UniProtKB were categorized into histone or nonhistone proteins. Support vector machines (SVMs) were applied to build predictive models by using amino acid pair composition (AAPC) as a feature in a histone model. We combined BLOSUM62 and AAPC features in a nonhistone model. Furthermore, using maximal dependence decomposition (MDD) clustering can enhance the performance of the model on a fivefold cross-validation evaluation to yield a sensitivity of 0.863, specificity of 0.885, accuracy of 0.880, and MCC of 0.706. Additionally, the proposed method is evaluated using independent test sets resulting in a predictive accuracy of 74%. This indicates that the performance of our method is comparable with that of other acetylation prediction methods.
引用
收藏
页数:11
相关论文
共 41 条
  • [1] [Anonymous], 2011, Pei. data mining concepts and techniques
  • [2] Identifying Protein Phosphorylation Sites with Kinase Substrate Specificity on Human Viruses
    Bretana, Neil Arvin
    Lu, Cheng-Tsung
    Chiang, Chiu-Yun
    Su, Min-Gang
    Huang, Kai-Yao
    Lee, Tzong-Yi
    Weng, Shun-Long
    [J]. PLOS ONE, 2012, 7 (07):
  • [3] Prediction of complete gene structures in human genomic DNA
    Burge, C
    Karlin, S
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) : 78 - 94
  • [4] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [5] VirusMINT: a viral protein interaction database
    Chatr-aryamontri, Andrew
    Ceol, Arnaud
    Peluso, Daniele
    Nardozza, Aurelio
    Panni, Simona
    Sacco, Francesca
    Tinti, Michele
    Smolyar, Alex
    Castagnoli, Luisa
    Vidal, Marc
    Cusick, Michael E.
    Cesareni, Gianni
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D669 - D673
  • [6] Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions
    Choudhary, Chunaram
    Kumar, Chanchal
    Gnad, Florian
    Nielsen, Michael L.
    Rehman, Michael
    Walther, Tobias C.
    Olsen, Jesper V.
    Mann, Matthias
    [J]. SCIENCE, 2009, 325 (5942) : 834 - 840
  • [7] Calorie restriction promotes mammalian cell survival by inducing the SIRT1 deacetylase
    Cohen, HY
    Miller, C
    Bitterman, KJ
    Wall, NR
    Hekking, B
    Kessler, B
    Howitz, KT
    Gorospe, M
    de Cabo, R
    Sinclair, DA
    [J]. SCIENCE, 2004, 305 (5682) : 390 - 392
  • [8] WebLogo: A sequence logo generator
    Crooks, GE
    Hon, G
    Chandonia, JM
    Brenner, SE
    [J]. GENOME RESEARCH, 2004, 14 (06) : 1188 - 1190
  • [9] Cancer epigenomics: DNA methylomes and histone-modification maps
    Esteller, Manel
    [J]. NATURE REVIEWS GENETICS, 2007, 8 (04) : 286 - 298
  • [10] Loss of acetylation at Lys16 and trimethylation at Lys20 of histone H4 is a common hallmark of human cancer
    Fraga, MF
    Ballestar, E
    Villar-Garea, A
    Boix-Chornet, M
    Espada, J
    Schotta, G
    Bonaldi, T
    Haydon, C
    Ropero, S
    Petrie, K
    Iyer, NG
    Pérez-Rosado, A
    Calvo, E
    Lopez, JA
    Cano, A
    Calasanz, MJ
    Colomer, D
    Piris, MA
    Ahn, N
    Imhof, A
    Caldas, C
    Jenuwein, T
    Esteller, M
    [J]. NATURE GENETICS, 2005, 37 (04) : 391 - 400