Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method

被引:12
|
作者
Zhou, You [1 ,2 ,3 ]
Huang, Tao [2 ,3 ]
Huang, Guohua [1 ]
Zhang, Ning [4 ]
Kong, XiangYin [2 ,3 ]
Cai, Yu-Dong [1 ]
机构
[1] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[2] Chinese Acad Sci, Inst Hlth Sci, Shanghai Inst Biol Sci, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Med, Shanghai, Peoples R China
[4] Tianjin Univ, Dept Biomed Engn, Tianjin Key Lab Biomed Engn Measurement, Tianjin, Peoples R China
基金
新加坡国家研究基金会; 中国国家自然科学基金;
关键词
N-formylation; N-acetylation; Post-translational modification; Random forest; Incremental feature selection; LINKER HISTONE H1; LYSINE ACETYLATION; POSTTRANSLATIONAL MODIFICATIONS; INTRINSIC DISORDER; SITES; METHYLATION; SEQUENCES; PHOSPHORYLATION; IDENTIFICATION; DATABASE;
D O I
10.1016/j.neucom.2015.10.148
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Post-translational modifications play important roles in cell activities ranging from gene regulation to cytoplasmic mechanisms. Unfortunately, experimental methods investigating protein post-translational modifications such as high-resolution mass spectrometry are time consuming, labor-intensive and expensive. Therefore, there is a need to develop computational methods to facilitate fast and efficient identification. In this study, we developed a method to predict N-formylated methionines based on the Dagging method. Various features were incorporated, including PSSM conservation scores, amino acid factors, secondary structures, solvent accessibilities and disorder scores. An optimal feature set was selected containing 28 features using the mRMR (Maximum Relevance Minimum Redundancy) method and the IFS (Incremental Feature Selection) method. The prediction model constructed based on these features achieved an accuracy of 0.9074 and a MCC value of 0.7478. Analysis of these optimal features was performed, and several important factors and important sites were revealed to play important roles in N-formylation formation. We also compared N-formylation with N-acetylation, another type of important N-terminal modification of methionines. A total of top 34 MaxRel (most relevant) features were selected to discriminate between the two types of modifications, which may be candidates for studying the different mechanisms between N-formylation and N-acetylation. The results from our study further the understanding of these two types of modifications and provide guidance for related validation experiments. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:53 / 62
页数:10
相关论文
共 50 条
  • [41] Genome-wide screen for inner nuclear membrane protein targeting in Saccharomyces cerevisiae:: Roles for N-acetylation and an integral membrane protein
    Murthi, A
    Hopper, AK
    GENETICS, 2005, 170 (04) : 1553 - 1560
  • [42] A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class
    Ni, Qianwu
    Chen, Lei
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (07) : 612 - 621
  • [43] Co3O4 nanoparticles prepared by oxidative precipitation method: an efficient and reusable heterogeneous catalyst for N-formylation of amines
    Marjani, Ahmad Poursattar
    Hosseini, Seyed Ali
    Shokri, Zahra
    Maleki, Nasim
    RESEARCH ON CHEMICAL INTERMEDIATES, 2017, 43 (01) : 413 - 422
  • [44] Co3O4 nanoparticles prepared by oxidative precipitation method: an efficient and reusable heterogeneous catalyst for N-formylation of amines
    Ahmad Poursattar Marjani
    Seyed Ali Hosseini
    Zahra Shokri
    Nasim Maleki
    Research on Chemical Intermediates, 2017, 43 : 413 - 422
  • [45] Prediction for Rational Synthesis Based on Weighted Feature Selection Method
    Qi, Miao
    Li, Jinsong
    Wang, Jianzhong
    Lu, Yinghua
    Kong, Jun
    MOLECULAR INFORMATICS, 2013, 32 (9-10) : 765 - 774
  • [46] An AIS Based Feature Selection Method For Software Fault Prediction
    Soleimani, A.
    Asdaghi, F.
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [47] Prediction of Protein-protein Interactions Based on Feature Selection and Data Balancing
    Liu, Liang
    Lu, Wen-Cong
    Cai, Yu-Dong
    Feng, Kai-Yan
    Peng, Chunrong
    Zhu, Yubei
    PROTEIN AND PEPTIDE LETTERS, 2013, 20 (03): : 336 - 345
  • [48] On-resin N-formylation of peptides: a head-to-head comparison of reagents in solid-phase synthesis of ligands for formyl peptide receptors
    Christensen, Simon Bendt
    Hansen, Anna Mette
    Franzyk, Henrik
    JOURNAL OF PEPTIDE SCIENCE, 2017, 23 (05) : 410 - 415
  • [49] Amino acid residue penultimate to the amino-terminal Gly residue strongly affects two cotranslational protein modifications, N-myristoylation and N-acetylation
    Utsumi, T
    Sato, M
    Nakano, K
    Takemura, D
    Iwata, H
    Ishisaka, R
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (13) : 10505 - 10513
  • [50] Prediction of protein N-terminal acetylation modification sites based on CNN-BiLSTM-attention model
    Ke J.
    Zhao J.
    Li H.
    Yuan L.
    Dong G.
    Wang G.
    Computers in Biology and Medicine, 2024, 174