Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method

被引:12
作者
Zhou, You [1 ,2 ,3 ]
Huang, Tao [2 ,3 ]
Huang, Guohua [1 ]
Zhang, Ning [4 ]
Kong, XiangYin [2 ,3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Chinese Acad Sci, Inst Hlth Sci, Shanghai Inst Biol Sci, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Med, Shanghai, Peoples R China
[4] Tianjin Univ, Dept Biomed Engn, Tianjin Key Lab Biomed Engn Measurement, Tianjin, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
N-formylation; N-acetylation; Post-translational modification; Random forest; Incremental feature selection; LINKER HISTONE H1; LYSINE ACETYLATION; POSTTRANSLATIONAL MODIFICATIONS; INTRINSIC DISORDER; SITES; METHYLATION; SEQUENCES; PHOSPHORYLATION; IDENTIFICATION; DATABASE;
D O I
10.1016/j.neucom.2015.10.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Post-translational modifications play important roles in cell activities ranging from gene regulation to cytoplasmic mechanisms. Unfortunately, experimental methods investigating protein post-translational modifications such as high-resolution mass spectrometry are time consuming, labor-intensive and expensive. Therefore, there is a need to develop computational methods to facilitate fast and efficient identification. In this study, we developed a method to predict N-formylated methionines based on the Dagging method. Various features were incorporated, including PSSM conservation scores, amino acid factors, secondary structures, solvent accessibilities and disorder scores. An optimal feature set was selected containing 28 features using the mRMR (Maximum Relevance Minimum Redundancy) method and the IFS (Incremental Feature Selection) method. The prediction model constructed based on these features achieved an accuracy of 0.9074 and a MCC value of 0.7478. Analysis of these optimal features was performed, and several important factors and important sites were revealed to play important roles in N-formylation formation. We also compared N-formylation with N-acetylation, another type of important N-terminal modification of methionines. A total of top 34 MaxRel (most relevant) features were selected to discriminate between the two types of modifications, which may be candidates for studying the different mechanisms between N-formylation and N-acetylation. The results from our study further the understanding of these two types of modifications and provide guidance for related validation experiments. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 62
页数:10
相关论文
共 57 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] EUKARYOTIC METHIONYL AMINOPEPTIDASES - 2 CLASSES OF COBALT-DEPENDENT ENZYMES
    ARFIN, SM
    KENDALL, RL
    HALL, L
    WEAVER, LH
    STEWART, AE
    MATTHEWS, BW
    BRADSHAW, RA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (17) : 7714 - 7718
  • [3] Solving the protein sequence metric problem
    Atchley, WR
    Zhao, JP
    Fernandes, AD
    Drüke, T
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (18) : 6395 - 6400
  • [4] DNA strand breaking by the hydroxyl radical is governed by the accessible surface areas of the hydrogen atoms of the DNA backbone
    Balasubramanian, B
    Pogozelski, WK
    Tullius, TD
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (17) : 9738 - 9743
  • [5] Predicting N-terminal acetylation based on feature selection method
    Cai, Yu-Dong
    Lu, Lin
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2008, 372 (04) : 862 - 865
  • [6] Prediction of Ubiquitination Sites by Using the Composition of k-Spaced Amino Acid Pairs
    Chen, Zhen
    Chen, Yong-Zi
    Wang, Xiao-Feng
    Wang, Chuan
    Yan, Ren-Xiang
    Zhang, Ziding
    [J]. PLOS ONE, 2011, 6 (07):
  • [7] Mapping post-translational modifications of the histone variant macroH2A1 using tandem mass spectrometry
    Chu, FX
    Nusinow, DA
    Chalkely, RJ
    Plath, K
    Panning, B
    Burlingame, AL
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2006, 5 (01) : 194 - 203
  • [8] WebLogo: A sequence logo generator
    Crooks, GE
    Hon, G
    Chandonia, JM
    Brenner, SE
    [J]. GENOME RESEARCH, 2004, 14 (06) : 1188 - 1190
  • [9] INCREASED ADP-RIBOSYLATION OF HISTONES IN ORAL-CANCER
    DAS, BR
    [J]. CANCER LETTERS, 1993, 73 (01) : 29 - 34
  • [10] Intrinsic disorder and protein function
    Dunker, AK
    Brown, CJ
    Lawson, JD
    Iakoucheva, LM
    Obradovic, Z
    [J]. BIOCHEMISTRY, 2002, 41 (21) : 6573 - 6582