Two-Level Protein Methylation Prediction using structure model-based features

被引:8
作者
Zheng, Wei [1 ,3 ,4 ]
Wuyun, Qiqige [2 ,3 ,4 ]
Cheng, Micah [5 ]
Hu, Gang [3 ,4 ]
Zhang, Yanping [6 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Michigan State Univ, Comp Sci & Engn Dept, E Lansing, MI 48823 USA
[3] Nankai Univ, Sch Math Sci, Tianjin 300071, Peoples R China
[4] Nankai Univ, LPMC, Tianjin 300071, Peoples R China
[5] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[6] Hebei Univ Engn, Sch Math & Phys, Dept Math, Handan 056038, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID; LYSINE METHYLATION; SITES; IDENTIFICATION; SERVER; HETEROCHROMATIN; DATABASE;
D O I
10.1038/s41598-020-62883-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein methylation plays a vital role in cell processing. Many novel methods try to predict methylation sites from protein sequence by sequence information or predicted structural information, but none of them use protein tertiary structure information in prediction. In particular, most of them do not build models for predicting methylation types (mono-, di-, tri-methylation). To address these problems, we propose a novel method, Met-predictor, to predict methylation sites and methylation types using a support vector machine-based network. Met-predictor combines a variety of sequence-based features that are derived from protein sequences with structure model-based features, which are geometric information extracted from predicted protein tertiary structure models, and are firstly used in methylation prediction. Met-predictor was tested on two independent test sets, where the addition of structure model-based features improved AUC from 0.611 and 0.520 to 0.655 and 0.566 for lysine and from 0.723 and 0.640 to 0.734 and 0.643 for arginine. When compared with other state-of-the-art methods, Met-predictor had 13.1% (3.9%) and 8.5% (16.4%) higher accuracy than the best of other methods for methyllysine and methylarginine prediction on the independent test set I (II). Furthermore, Met-predictor also attains excellent performance for predicting methylation types.
引用
收藏
页数:15
相关论文
共 50 条
[41]   Improvement of recognition speed protein tertiary structure prediction using hidden Markov model [J].
Khedr, Ahmed M. .
KUWAIT JOURNAL OF SCIENCE & ENGINEERING, 2011, 38 (2A) :147-161
[42]   Structure-sequence features based prediction of phosphosites of serine/threonine protein kinases of Mycobacterium tuberculosis [J].
Nilkanth, Vipul V. ;
Mande, Shekhar C. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2022, 90 (01) :131-141
[43]   Structure-based prediction of protein allostery [J].
Greener, Joe G. ;
Sternberg, Michael J. E. .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2018, 50 :1-8
[44]   Position-Specific Analysis and Prediction of Protein Pupylation Sites Based on Multiple Features [J].
Zhao, Xiaowei ;
Dai, Jiangyan ;
Ning, Qiao ;
Ma, Zhiqiang ;
Yin, Minghao ;
Sun, Pingping .
BIOMED RESEARCH INTERNATIONAL, 2013, 2013
[45]   Protein Secondary Structure Prediction Based on Improved SVM Method in Compound Pyramid Model [J].
Yang, Bingru ;
Qu, Wu ;
Zhai, Yun ;
Sui, Haifeng .
2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, :4405-4410
[46]   Prediction of Heterodimeric Protein Complexes from Weighted Protein-Protein Interaction Networks Using Novel Features and Kernel Functions [J].
Ruan, Peiying ;
Hayashida, Morihiro ;
Maruyama, Osamu ;
Akutsu, Tatsuya .
PLOS ONE, 2013, 8 (06)
[47]   iLoops: a protein-protein interaction prediction server based on structural features [J].
Planas-Iglesias, Joan ;
Marin-Lopez, Manuel A. ;
Bonet, Jaume ;
Garcia-Garcia, Javier ;
Oliva, Baldo .
BIOINFORMATICS, 2013, 29 (18) :2360-2362
[48]   Prediction and Analysis of Protein Methylarginine and Methyllysine Based on Multisequence Features [J].
Hu, Le-Le ;
Li, Zhen ;
Wang, Kai ;
Niu, Shen ;
Shi, Xiao-He ;
Cai, Yu-Dong ;
Li, Hai-Peng .
BIOPOLYMERS, 2011, 95 (11) :763-771
[49]   A Novel Methylation-Based Model for Prognostic Prediction in Lung Adenocarcinoma [J].
Li, Manyuan ;
Deng, Xufeng ;
Zhou, Dong ;
Liu, Xiaoqing ;
Dai, Jigang ;
Liu, Quanxing .
CURRENT GENOMICS, 2024, 25 (01) :26-40
[50]   GENETIC ALGORITHM BASED ITERATIVE TWO-LEVEL ALGORITHM FOR RESOURCE ALLOCATION PROBLEMS AND APPLICATIONS [J].
Lin, Shin-Yeu ;
Chang, Che-Yen .
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (10B) :7157-7168