Prediction of protein amidation sites by feature selection and analysis

被引:0
|
作者
Weiren Cui
Shen Niu
Lulu Zheng
Lele Hu
Tao Huang
Lei Gu
Kaiyan Feng
Ning Zhang
Yudong Cai
Yixue Li
机构
[1] Institute of Systems Biology,CAS
[2] Shanghai University,MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences
[3] Chinese Academy of Sciences,Department of Mathematics, College of Science
[4] Shanghai University,Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences
[5] Chinese Academy of Sciences,Hubei Bioinformatics and Molecular Imaging Key Laboratory
[6] Shanghai Center for Bioinformation Technology,Division of Theoretical Bioinformatics (BO80)
[7] Huazhong University of Science and Technology,Tianjin Key Lab of BME Measurement, Department of Biomedical Engineering
[8] German Cancer Research Center,undefined
[9] Tianjin University,undefined
来源
Molecular Genetics and Genomics | 2013年 / 288卷
关键词
Amidation; Maximum relevance minimum redundancy; Incremental feature selection; Nearest neighbor algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Carboxy-terminal α-amidation is a widespread post-translational modification of proteins found widely in vertebrates and invertebrates. The α-amide group is required for full biological activity, since it may render a peptide more hydrophobic and thus better be able to bind to other proteins, preventing ionization of the C-terminus. However, in particular, the C-terminal amidation is very difficult to detect because experimental methods are often labor-intensive, time-consuming and expensive. Therefore, in silico methods may complement due to their high efficiency. In this study, a computational method was developed to predict protein amidation sites, by incorporating the maximum relevance minimum redundancy method and the incremental feature selection method based on the nearest neighbor algorithm. From a total of 735 features, 41 optimal features were selected and were utilized to construct the final predictor. As a result, the predictor achieved an overall Matthews correlation coefficient of 0.8308. Feature analysis showed that PSSM conservation scores and amino acid factors played the most important roles in the α-amidation site prediction. Site-specific feature analyses showed that features derived from the amidation site itself and adjacent sites were most significant. This method presented could be used as an efficient tool to theoretically predict amidated peptides. And the selected features from our study could shed some light on the in-depth understanding of the mechanisms of the amidation modification, providing guidelines for experimental validation.
引用
收藏
页码:391 / 400
页数:9
相关论文
共 50 条
  • [31] Integrative approaches to the prediction of protein functions based on the feature selection
    Seokha Ko
    Hyunju Lee
    BMC Bioinformatics, 10
  • [32] Comparative Study on Feature Selection in Protein Structure and Function Prediction
    Yi, Wenjing
    Sun, Ao
    Liu, Manman
    Liu, Xiaoqing
    Zhang, Wei
    Dai, Qi
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [33] Prediction of Protein Structural Classes Based on Feature Selection Technique
    Ding, Hui
    Lin, Hao
    Chen, Wei
    Li, Zi-Qiang
    Guo, Feng-Biao
    Huang, Jian
    Rao, Nini
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2014, 6 (03) : 235 - 240
  • [34] Feature selection for data driven prediction of protein model quality
    Montuori, Alfonso
    Pugliese, Luisa
    Raimondo, Giovanni
    Pasero, Eros
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 3561 - +
  • [35] LncRNA-protein interaction prediction with reweighted feature selection
    Lv, Guohao
    Xia, Yingchun
    Qi, Zhao
    Zhao, Zihao
    Tang, Lianggui
    Chen, Cheng
    Yang, Shuai
    Wang, Qingyong
    Gu, Lichuan
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [36] Prediction of lysine glutarylation sites by maximum relevance minimum redundancy feature selection
    Ju, Zhe
    He, Jian-Jun
    ANALYTICAL BIOCHEMISTRY, 2018, 550 : 1 - 7
  • [37] Prediction of Protein-protein Interactions Based on Feature Selection and Data Balancing
    Liu, Liang
    Lu, Wen-Cong
    Cai, Yu-Dong
    Feng, Kai-Yan
    Peng, Chunrong
    Zhu, Yubei
    PROTEIN AND PEPTIDE LETTERS, 2013, 20 (03): : 336 - 345
  • [38] Exploratory Analysis and Feature Selection for the Prediction of Nitrogen Dioxide
    Iskandaryan, Ditsuhi
    Di Sabatino, Silvana
    Ramos, Francisco
    Trilles, Sergio
    25TH AGILE CONFERENCE ON GEOGRAPHIC INFORMATION SCIENCE ARTIFICIAL INTELLIGENCE IN THE SERVICE OF GEOSPATIAL TECHNOLOGIES, 2022, 3
  • [39] Prediction of Tyrosine Sulfation with mRMR Feature Selection and Analysis
    Niu, Shen
    Huang, Tao
    Feng, Kaiyan
    Cai, Yudong
    Li, Yixue
    JOURNAL OF PROTEOME RESEARCH, 2010, 9 (12) : 6490 - 6497
  • [40] Efficient Feature Selection for Static Analysis Vulnerability Prediction
    Filus, Katarzyna
    Boryszko, Pawel
    Domanska, Joanna
    Siavvas, Miltiadis
    Gelenbe, Erol
    SENSORS, 2021, 21 (04) : 1 - 25