Computational Prediction of Protein Epsilon Lysine Acetylation Sites Based on a Feature Selection Method

被引:7
|
作者
Gao, Jianzhao [1 ,2 ]
Tao, Xue-Wen [3 ]
Zhao, Jia [4 ]
Feng, Yuan-Ming [3 ]
Cai, Yu-Dong [5 ]
Zhang, Ning [3 ]
机构
[1] Nankai Univ, Sch Math Sci, Tianjin, Peoples R China
[2] Nankai Univ, LPMC, Tianjin, Peoples R China
[3] Tianjin Univ, Tianjin Key Lab Biomed Engn Measurement, Dept Biomed Engn, Tianjin, Peoples R China
[4] CODBIO Co Ltd, Biomed Res Ctr, Tianjin, Peoples R China
[5] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Acetylation; post-translational modification; dagging; maximum relevance minimum redundancy; incremental feature selection; epsilon lysine acetylation site; NONHISTONE PROTEINS; METHYLATION; DISORDER; SUBSTRATE; SEQUENCES; DATABASE; TARGETS; BINDING; SETS;
D O I
10.2174/1386207320666170314093216
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aim and Objective: Lysine acetylation, as one type of post-translational modifications (PTM), plays key roles in cellular regulations and can be involved in a variety of human diseases. However, it is often high-cost and time-consuming to use traditional experimental approaches to identify the lysine acetylation sites. Therefore, effective computational methods should be developed to predict the acetylation sites. In this study, we developed a position-specific method for epsilon lysine acetylation site prediction. Material and Methods: Sequences of acetylated proteins were retrieved from the UniProt database. Various kinds of features such as position specific scoring matrix (PSSM), amino acid factors (AAF), and disorders were incorporated. A feature selection method based on mRMR (Maximum Relevance Minimum Redundancy) and IFS (Incremental Feature Selection) was employed. Results: Finally, 319 optimal features were selected from total 541 features. Using the 319 optimal features to encode peptides, a predictor was constructed based on dagging. As a result, an accuracy of 69.56% with MCC of 0.2792 was achieved. We analyzed the optimal features, which suggested some important factors determining the lysine acetylation sites. Conclusion: We developed a position-specific method for epsilon lysine acetylation site prediction. A set of optimal features was selected. Analysis of the optimal features provided insights into the mechanism of lysine acetylation sites, providing guidance of experimental validation.
引用
收藏
页码:629 / 637
页数:9
相关论文
共 50 条
  • [1] Prediction Method for Lysine Acetylation Sites Based on LSTM Network
    Xiu, Qingxiao
    Li, Dancheng
    Li, Hailong
    Wang, Ning
    Ding, Chen
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 179 - 182
  • [2] ProAcePred: prokaryote lysine acetylation sites prediction based on elastic net feature optimization
    Chen, Guodong
    Cao, Man
    Luo, Kun
    Wang, Lina
    Wen, Pingping
    Shi, Shaoping
    BIOINFORMATICS, 2018, 34 (23) : 3999 - 4006
  • [3] A method to distinguish between lysine acetylation and lysine ubiquitination with feature selection and analysis
    Zhoua, You
    Zhang, Ning
    Li, Bi-Qing
    Huang, Tao
    Cai, Yu-Dong
    Kong, Xiang-Yin
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2015, 33 (11): : 2479 - 2490
  • [4] Prediction of Lysine Acetylation Sites Based on Neural Network
    Bao, Wenzheng
    Jiang, Zhichao
    Han, Kyungsook
    Huang, De-Shuang
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 873 - 879
  • [5] Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method
    Zhou, You
    Huang, Tao
    Huang, Guohua
    Zhang, Ning
    Kong, XiangYin
    Cai, Yu-Dong
    NEUROCOMPUTING, 2016, 217 : 53 - 62
  • [6] Computational Method for Distinguishing Lysine Acetylation, Sumoylation, and Ubiquitination using the Random Forest Algorithm with a Feature Selection Procedure
    Wang, ShaoPeng
    Li, JiaRui
    Yuan, Fei
    Chen, Lei
    Huang, Tao
    Cai, Yu-Dong
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2017, 20 (10) : 886 - 895
  • [7] Prediction of protein amidation sites by feature selection and analysis
    Weiren Cui
    Shen Niu
    Lulu Zheng
    Lele Hu
    Tao Huang
    Lei Gu
    Kaiyan Feng
    Ning Zhang
    Yudong Cai
    Yixue Li
    Molecular Genetics and Genomics, 2013, 288 : 391 - 400
  • [8] Prediction of protein amidation sites by feature selection and analysis
    Cui, Weiren
    Niu, Shen
    Zheng, Lulu
    Hu, Lele
    Huang, Tao
    Gu, Lei
    Feng, Kaiyan
    Zhang, Ning
    Cai, Yudong
    Li, Yixue
    MOLECULAR GENETICS AND GENOMICS, 2013, 288 (09) : 391 - 400
  • [9] Protein sumoylation sites prediction based on two-stage feature selection
    Lin Lu
    Xiao-He Shi
    Su-Jun Li
    Zhi-Qun Xie
    Yong-Li Feng
    Wen-Cong Lu
    Yi-Xue Li
    Haipeng Li
    Yu-Dong Cai
    Molecular Diversity, 2010, 14 : 81 - 86
  • [10] Protein sumoylation sites prediction based on two-stage feature selection
    Lu, Lin
    Shi, Xiao-He
    Li, Su-Jun
    Xie, Zhi-Qun
    Feng, Yong-Li
    Lu, Wen-Cong
    Li, Yi-Xue
    Li, Haipeng
    Cai, Yu-Dong
    MOLECULAR DIVERSITY, 2010, 14 (01) : 81 - 86