Protein methylation site;
machine learning based method;
feature representation;
feature selection technique;
CITRULLINATION;
PSEKNC;
D O I:
10.1109/TCBB.2017.2670558
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Protein methylation, an important post-translational modification, plays crucial roles in many cellular processes. The accurate prediction of protein methylation sites is fundamentally important for revealing the molecular mechanisms undergoing methylation. In recent years, computational prediction based on machine learning algorithms has emerged as a powerful and robust approach for identifying methylation sites, and much progress has been made in predictive performance improvement. However, the predictive performance of existing methods is not satisfactory in terms of overall accuracy. Motivated by this, we propose a novel random-forest-based predictor called MePred-RF, integrating several discriminative sequence-based feature descriptors and improving feature representation capability using a powerful feature selection technique. Importantly, unlike other methods based on multiple, complex information inputs, our proposed MePred-RF is based on sequence information alone. Comparative studies on benchmark datasets via vigorous jackknife tests indicate that our proposed MePred-RF method remarkably outperforms other state-of-the-art predictors, leading by a 4.5 percent average in terms of overall accuracy. A user-friendly webserver that implements the proposed method has been established for researchers' convenience, and is now freely available for public use through http://server.malab.cn/MePred-RF. We anticipate our research tool to be useful for the large-scale prediction and analysis of protein methylation sites.
机构:
Univ Kansas, Ctr Bioinformat, Lawrence, KS 66047 USA
Univ Kansas, Dept Mol Biosci, Lawrence, KS 66047 USA
Univ Calif San Diego, Ctr Res Biol Syst, La Jolla, CA 92093 USAUniv Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
Wu, Sitao
Szilagyi, Andras
论文数: 0引用数: 0
h-index: 0
机构:
Univ Kansas, Ctr Bioinformat, Lawrence, KS 66047 USA
Univ Kansas, Dept Mol Biosci, Lawrence, KS 66047 USA
Hungarian Acad Sci, Inst Enzymol, H-1113 Budapest, HungaryUniv Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
Szilagyi, Andras
Zhang, Yang
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
Univ Kansas, Ctr Bioinformat, Lawrence, KS 66047 USA
Univ Kansas, Dept Mol Biosci, Lawrence, KS 66047 USAUniv Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
机构:
Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China
Sun, Jia-Nan
Yang, Hua-Yi
论文数: 0引用数: 0
h-index: 0
机构:
Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China
Yang, Hua-Yi
Yao, Jing
论文数: 0引用数: 0
h-index: 0
机构:
Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China
Yao, Jing
DIng, Hui
论文数: 0引用数: 0
h-index: 0
机构:
Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China
DIng, Hui
Han, Shu-Guang
论文数: 0引用数: 0
h-index: 0
机构:
Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China
Han, Shu-Guang
Wu, Cheng-Yan
论文数: 0引用数: 0
h-index: 0
机构:
Baotou Teacher's College, Inner Mongolia University of Science and Technology, Baotou,014010, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China
Wu, Cheng-Yan
Tang, Hua
论文数: 0引用数: 0
h-index: 0
机构:
Department of Pathophysiology, Key Laboratory of Medical Electrophysiology, Ministry of Education, Southwest Medical University, Luzhou,646000, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China
Tang, Hua
Tang, Hua
论文数: 0引用数: 0
h-index: 0
机构:
Central Nervous System Drug Key Laboratory of Sichuan Province, Luzhou,646000, ChinaCenter for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu,610054, China