Two-Level Protein Methylation Prediction using structure model-based features

被引:8
作者
Zheng, Wei [1 ,3 ,4 ]
Wuyun, Qiqige [2 ,3 ,4 ]
Cheng, Micah [5 ]
Hu, Gang [3 ,4 ]
Zhang, Yanping [6 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Michigan State Univ, Comp Sci & Engn Dept, E Lansing, MI 48823 USA
[3] Nankai Univ, Sch Math Sci, Tianjin 300071, Peoples R China
[4] Nankai Univ, LPMC, Tianjin 300071, Peoples R China
[5] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[6] Hebei Univ Engn, Sch Math & Phys, Dept Math, Handan 056038, Peoples R China
基金
中国国家自然科学基金;
关键词
AMINO-ACID; LYSINE METHYLATION; SITES; IDENTIFICATION; SERVER; HETEROCHROMATIN; DATABASE;
D O I
10.1038/s41598-020-62883-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein methylation plays a vital role in cell processing. Many novel methods try to predict methylation sites from protein sequence by sequence information or predicted structural information, but none of them use protein tertiary structure information in prediction. In particular, most of them do not build models for predicting methylation types (mono-, di-, tri-methylation). To address these problems, we propose a novel method, Met-predictor, to predict methylation sites and methylation types using a support vector machine-based network. Met-predictor combines a variety of sequence-based features that are derived from protein sequences with structure model-based features, which are geometric information extracted from predicted protein tertiary structure models, and are firstly used in methylation prediction. Met-predictor was tested on two independent test sets, where the addition of structure model-based features improved AUC from 0.611 and 0.520 to 0.655 and 0.566 for lysine and from 0.723 and 0.640 to 0.734 and 0.643 for arginine. When compared with other state-of-the-art methods, Met-predictor had 13.1% (3.9%) and 8.5% (16.4%) higher accuracy than the best of other methods for methyllysine and methylarginine prediction on the independent test set I (II). Furthermore, Met-predictor also attains excellent performance for predicting methylation types.
引用
收藏
页数:15
相关论文
共 50 条
[21]   On the vehicle dynamics prediction via model-based observation [J].
Leanza, Antonio ;
Mantriota, Giacomo ;
Reina, Giulio .
VEHICLE SYSTEM DYNAMICS, 2024, 62 (05) :1181-1202
[22]   aPRBind: protein-RNA interface prediction by combining sequence and I-TASSER model-based structural features learned with convolutional neural networks [J].
Liu, Yang ;
Gong, Weikang ;
Zhao, Yanpeng ;
Deng, Xueqing ;
Zhang, Shan ;
Li, Chunhua .
BIOINFORMATICS, 2021, 37 (07) :937-942
[23]   Prediction of drug-target interaction based on protein features using undersampling and feature selection techniques with boosting [J].
Mahmud, S. M. Hasan ;
Chen, Wenyu ;
Meng, Han ;
Jahan, Hosney ;
Liu, Yongsheng ;
Hasan, S. M. Mamun .
ANALYTICAL BIOCHEMISTRY, 2020, 589
[24]   PROTEIN SECONDARY STRUCTURE PREDICTION USING KNOWLEDGE-BASED POTENTIALS [J].
Saraswathi, Saras ;
Jernigan, Robert L. ;
Kloczkowski, Andrzej ;
Kolinski, Andrzej .
ICFC 2010/ ICNC 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION AND INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION, 2010, :370-375
[25]   Industrial Process Monitoring Based on Knowledge Data Integrated Sparse Model and Two-Level Deviation Magnitude Plots [J].
Luo, Lijia ;
Bao, Shiyi ;
Mao, Jianfeng ;
Ding, Zhenyu .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2018, 57 (02) :611-622
[26]   Protein structure prediction and model quality assessment [J].
Kryshtafovych, Andriy ;
Fidelis, Krzysztof .
DRUG DISCOVERY TODAY, 2009, 14 (7-8) :386-393
[27]   HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features [J].
Zaman, Rianon ;
Chowdhury, Shahana Yasmin ;
Rashid, Mahmood A. ;
Sharma, Alok ;
Dehzangi, Abdollah ;
Shatabda, Swakkhar .
BIOMED RESEARCH INTERNATIONAL, 2017, 2017
[28]   Prediction of promoters and enhancers using multiple DNA methylation-associated features [J].
Hwang, Woochang ;
Oliver, Verity F. ;
Merbs, Shannath L. ;
Zhu, Heng ;
Qian, Jiang .
BMC GENOMICS, 2015, 16
[29]   Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier [J].
Wei, Leyi ;
Xing, Pengwei ;
Zeng, Jiancang ;
Chen, JinXiu ;
Su, Ran ;
Guo, Fei .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 83 :67-74
[30]   QAUST: Protein Function Prediction Using Structure Similarity, Protein Interaction, and Functional Motifs [J].
Smaili, Fatima Zohra ;
Tian, Shuye ;
Roy, Ambrish ;
Alazmi, Meshari ;
Arold, Stefan T. ;
Mukherjee, Srayanta ;
Hefty, P. Scott ;
Chen, Wei ;
Gao, Xin .
GENOMICS PROTEOMICS & BIOINFORMATICS, 2021, 19 (06) :998-1011