Two-Level Protein Methylation Prediction using structure model-based features

被引:8
作者
Zheng, Wei [1 ,3 ,4 ]
Wuyun, Qiqige [2 ,3 ,4 ]
Cheng, Micah [5 ]
Hu, Gang [3 ,4 ]
Zhang, Yanping [6 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Michigan State Univ, Comp Sci & Engn Dept, E Lansing, MI 48823 USA
[3] Nankai Univ, Sch Math Sci, Tianjin 300071, Peoples R China
[4] Nankai Univ, LPMC, Tianjin 300071, Peoples R China
[5] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[6] Hebei Univ Engn, Sch Math & Phys, Dept Math, Handan 056038, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID; LYSINE METHYLATION; SITES; IDENTIFICATION; SERVER; HETEROCHROMATIN; DATABASE;
D O I
10.1038/s41598-020-62883-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein methylation plays a vital role in cell processing. Many novel methods try to predict methylation sites from protein sequence by sequence information or predicted structural information, but none of them use protein tertiary structure information in prediction. In particular, most of them do not build models for predicting methylation types (mono-, di-, tri-methylation). To address these problems, we propose a novel method, Met-predictor, to predict methylation sites and methylation types using a support vector machine-based network. Met-predictor combines a variety of sequence-based features that are derived from protein sequences with structure model-based features, which are geometric information extracted from predicted protein tertiary structure models, and are firstly used in methylation prediction. Met-predictor was tested on two independent test sets, where the addition of structure model-based features improved AUC from 0.611 and 0.520 to 0.655 and 0.566 for lysine and from 0.723 and 0.640 to 0.734 and 0.643 for arginine. When compared with other state-of-the-art methods, Met-predictor had 13.1% (3.9%) and 8.5% (16.4%) higher accuracy than the best of other methods for methyllysine and methylarginine prediction on the independent test set I (II). Furthermore, Met-predictor also attains excellent performance for predicting methylation types.
引用
收藏
页数:15
相关论文
共 50 条
[31]   Prediction of protease substrates using sequence and structure features [J].
Barkan, David T. ;
Hostetter, Daniel R. ;
Mahrus, Sami ;
Pieper, Ursula ;
Wells, James A. ;
Craik, Charles S. ;
Sali, Andrej .
BIOINFORMATICS, 2010, 26 (14) :1714-1722
[32]   Using WPNNA Classifier in Ubiquitination Site Prediction Based on Hybrid Features [J].
Feng, Kai-Yan ;
Huang, Tao ;
Feng, Kai-Rui ;
Liu, Xiao-Jun .
PROTEIN AND PEPTIDE LETTERS, 2013, 20 (03) :318-323
[33]   Sound Source Localization Indoors Based on Two-Level Reference Points Matching [J].
Wang, Shuopeng ;
Yang, Peng ;
Sun, Hao .
APPLIED SCIENCES-BASEL, 2022, 12 (19)
[34]   Structure-Based Prediction of Protein Phosphorylation Sites Using an Ensemble Approach [J].
Gao, Yong ;
Hao, Weilin ;
Chen, Zhigang ;
Deng, Lei .
INTELLIGENT COMPUTING IN BIOINFORMATICS, 2014, 8590 :119-125
[35]   Structure-based protein function prediction using graph convolutional networks [J].
Gligorijevic, Vladimir ;
Renfrew, P. Douglas ;
Kosciolek, Tomasz ;
Leman, Julia Koehler ;
Berenberg, Daniel ;
Vatanen, Tommi ;
Chandler, Chris ;
Taylor, Bryn C. ;
Fisk, Ian M. ;
Vlamakis, Hera ;
Xavier, Ramnik J. ;
Knight, Rob ;
Cho, Kyunghyun ;
Bonneau, Richard .
NATURE COMMUNICATIONS, 2021, 12 (01)
[36]   Effect of using suboptimal alignments in template-based protein structure prediction [J].
Chen, Hao ;
Kihara, Daisuke .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (01) :315-334
[37]   iLM-2L: A two-level predictor for identifying protein lysine methylation sites and their methylation degrees by incorporating K-gap amino acid pairs into Chou's general PseAAC [J].
Ju, Zhe ;
Cao, Jun-Zhe ;
Gu, Hong .
JOURNAL OF THEORETICAL BIOLOGY, 2015, 385 :50-57
[38]   Protein location prediction using atomic composition and global features of the amino acid sequence [J].
Cherian, Betsy Sheena ;
Nair, Achuthsankar S. .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2010, 391 (04) :1670-1674
[39]   Control of Three-Phase Two-Level Inverters: A Stochastic LPV Model Approach [J].
Luo, Wensheng ;
Zhang, Ruifang ;
Zhang, Jianwen ;
Wu, Ligang ;
Vazquez, Sergio ;
Franquelo, Leopoldo G. .
ENERGIES, 2024, 17 (23)
[40]   Protein Complexes Prediction Method Based on CoreAttachment Structure and Functional Annotations [J].
Li, Bo ;
Liao, Bo .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (09)