Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method

被引:12
|
作者
Zhou, You [1 ,2 ,3 ]
Huang, Tao [2 ,3 ]
Huang, Guohua [1 ]
Zhang, Ning [4 ]
Kong, XiangYin [2 ,3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Chinese Acad Sci, Inst Hlth Sci, Shanghai Inst Biol Sci, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Med, Shanghai, Peoples R China
[4] Tianjin Univ, Dept Biomed Engn, Tianjin Key Lab Biomed Engn Measurement, Tianjin, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
N-formylation; N-acetylation; Post-translational modification; Random forest; Incremental feature selection; LINKER HISTONE H1; LYSINE ACETYLATION; POSTTRANSLATIONAL MODIFICATIONS; INTRINSIC DISORDER; SITES; METHYLATION; SEQUENCES; PHOSPHORYLATION; IDENTIFICATION; DATABASE;
D O I
10.1016/j.neucom.2015.10.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Post-translational modifications play important roles in cell activities ranging from gene regulation to cytoplasmic mechanisms. Unfortunately, experimental methods investigating protein post-translational modifications such as high-resolution mass spectrometry are time consuming, labor-intensive and expensive. Therefore, there is a need to develop computational methods to facilitate fast and efficient identification. In this study, we developed a method to predict N-formylated methionines based on the Dagging method. Various features were incorporated, including PSSM conservation scores, amino acid factors, secondary structures, solvent accessibilities and disorder scores. An optimal feature set was selected containing 28 features using the mRMR (Maximum Relevance Minimum Redundancy) method and the IFS (Incremental Feature Selection) method. The prediction model constructed based on these features achieved an accuracy of 0.9074 and a MCC value of 0.7478. Analysis of these optimal features was performed, and several important factors and important sites were revealed to play important roles in N-formylation formation. We also compared N-formylation with N-acetylation, another type of important N-terminal modification of methionines. A total of top 34 MaxRel (most relevant) features were selected to discriminate between the two types of modifications, which may be candidates for studying the different mechanisms between N-formylation and N-acetylation. The results from our study further the understanding of these two types of modifications and provide guidance for related validation experiments. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 62
页数:10
相关论文
共 50 条
  • [31] Prediction of Nε-acetylation on internal lysines implemented in Bayesian Discriminant Method
    Li, Ao
    Xue, Yu
    Jin, Changjiang
    Wang, Minghui
    Yao, Xuebiao
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2006, 350 (04) : 818 - 824
  • [32] SPECIES-DIFFERENCES IN THE N-ACETYLATION BY LIVER CYTOSOL OF MUTAGENIC HETEROCYCLIC AROMATIC-AMINES IN PROTEIN PYROLYSATES
    SHINOHARA, A
    YAMAZOE, Y
    SAITO, K
    KAMATAKI, T
    KATO, R
    CARCINOGENESIS, 1984, 5 (05) : 683 - 686
  • [33] A RAPID METHOD FOR THE DETERMINATION OF THE DEGREE OF N-ACETYLATION OF CHITIN-CHITOSAN SAMPLES BY ACID-HYDROLYSIS AND HPLC
    NIOLA, F
    BASORA, N
    CHORNET, E
    VIDAL, PF
    CARBOHYDRATE RESEARCH, 1993, 238 : 1 - 9
  • [34] Prediction of polymorphic N-acetylation of new drug candidates by correlation with human NAT1 and NAT2
    Jemnitz, K
    Vereczkey, L
    DRUG DEVELOPMENT RESEARCH, 2002, 56 (01) : 17 - 22
  • [35] A Keggin-based hybrid solid emerged as a promising candidate for CO2-mediated photocatalytic N-formylation of amines
    Sood, Parul
    Bhatt, Sakshi
    Bagdwal, Harshita
    Joshi, Arti
    Singh, Ashi
    Jain, Suman L.
    Singh, Monika
    JOURNAL OF MATERIALS CHEMISTRY A, 2024, 12 (30) : 19168 - 19175
  • [36] A Cu-salen-based conjugated microporous polymer catalyst for N-formylation of CO2 under mild conditions
    Gu, Shuai
    Shou, Junxi
    Chen, Anqi
    Yu, Wenhua
    Tang, Ruiren
    Pan, Chunyue
    Tang, Juntao
    Yu, Guipeng
    CHEMICAL COMMUNICATIONS, 2025, 61 (11) : 2309 - 2312
  • [37] Prediction of protein structural classes based on feature selection technique
    Hui Ding
    Hao Lin
    Wei Chen
    Zi-Qiang Li
    Feng-Biao Guo
    Jian Huang
    Nini Rao
    Interdisciplinary Sciences: Computational Life Sciences, 2014, 6 : 235 - 240
  • [38] Integrative approaches to the prediction of protein functions based on the feature selection
    Ko, Seokha
    Lee, Hyunju
    BMC BIOINFORMATICS, 2009, 10
  • [39] Integrative approaches to the prediction of protein functions based on the feature selection
    Seokha Ko
    Hyunju Lee
    BMC Bioinformatics, 10
  • [40] Prediction of Protein Structural Classes Based on Feature Selection Technique
    Ding, Hui
    Lin, Hao
    Chen, Wei
    Li, Zi-Qiang
    Guo, Feng-Biao
    Huang, Jian
    Rao, Nini
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2014, 6 (03) : 235 - 240