TMPpred: A support vector machine-based thermophilic protein identifier

被引:15
作者
Meng, Chaolu [1 ,2 ]
Ju, Ying [3 ]
Shi, Hua [4 ]
机构
[1] Inner Mongolia Agr Univ, Coll Comp & Informat Engn, Hohhot, Peoples R China
[2] Inner Mongolia Autonomous Reg Key Lab Big Data Res, Hohhot, Peoples R China
[3] Xiamen Univ, Sch Informat, Xiamen, Peoples R China
[4] Xiamen Univ Technol, Sch Optoelect & Commun Engn, Xiamen, Peoples R China
基金
中国国家自然科学基金;
关键词
Thermostability of protein; Machine learning; Support vector machine; Binary classification; FEATURE-SELECTION; TRANSCRIPTION FACTORS; PREDICTION; SEQUENCE; INFORMATION;
D O I
10.1016/j.ab.2022.114625
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The thermostability of proteins will cause them to break the temperature binding and play more functions. Using machine learning, we explored the mechanism of and reasons for protein thermostability characteristics. Results: Different from other methods that only pursue the performance of models, we aim to find important features so as to provide a powerful reference for in vitro experiments. We transformed this problem into a binary classification problem, that is, the distinction between thermophilic proteins and nonthermophilic proteins. Using support vector machine-based model construction and analysis, we inferred that Gly, Ala, Ser and Thr may be the most important components at the residue level that determine the thermal stability of proteins. It is also noteworthy that our proposed model obtains an Sn of 0.892, an Sp of 0.857, an ACC of 0.87566 and an AUC of 0.874. To facilitate other researchers, we wrapped our model and deployed it as a web server, which is accessible at http://112.124.26.17:7000/TMPpred/index.html.
引用
收藏
页数:7
相关论文
共 77 条
[1]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[2]   ITP-Pred: an interpretable method for predicting, therapeutic peptides with fused features low-dimension representation [J].
Cai, Lijun ;
Wang, Li ;
Fu, Xiangzheng ;
Xia, Chenxing ;
Zeng, Xiangxiang ;
Zou, Quan .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
[3]   Large-scale prediction of drug-target interactions using protein sequences and drug topological structures [J].
Cao, Dong-Sheng ;
Liu, Shao ;
Xu, Qing-Song ;
Lu, Hong-Mei ;
Huang, Jian-Hua ;
Hu, Qian-Nan ;
Liang, Yi-Zeng .
ANALYTICA CHIMICA ACTA, 2012, 752 :1-10
[4]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[5]   Investigating the gene expression profiles of cells in seven embryonic stages with machine learning algorithms [J].
Chen, Lei ;
Pan, XiaoYong ;
Guo, Wei ;
Gan, Zijun ;
Zhang, Yu-Hang ;
Niu, Zhibin ;
Huang, Tao ;
Cai, Yu-Dong .
GENOMICS, 2020, 112 (03) :2524-2534
[6]   Stability Mechanisms of a Thermophilic Laccase Probed by Molecular Dynamics [J].
Christensen, Niels J. ;
Kepp, Kasper P. .
PLOS ONE, 2013, 8 (04)
[7]   A computational platform to identify origins of replication sites in eukaryotes [J].
Dao, Fu-Ying ;
Lv, Hao ;
Zulfiqar, Hasan ;
Yang, Hui ;
Su, Wei ;
Gao, Hui ;
Ding, Hui ;
Lin, Hao .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (02) :1940-1950
[8]   Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique [J].
Dao, Fu-Ying ;
Lv, Hao ;
Wang, Fang ;
Feng, Chao-Qin ;
Ding, Hui ;
Chen, Wei ;
Lin, Hao .
BIOINFORMATICS, 2019, 35 (12) :2075-2083
[9]   Recent Advances in Conotoxin Classification by Using Machine Learning Methods [J].
Dao, Fu-Ying ;
Yang, Hui ;
Su, Zhen-Dong ;
Yang, Wuritu ;
Wu, Yun ;
Ding, Hui ;
Chen, Wei ;
Tang, Hua ;
Lin, Hao .
MOLECULES, 2017, 22 (07)
[10]   Effective Design of Multifunctional Peptides by Combining Compatible Functions [J].
Diener, Christian ;
Ramos Martinez, Georgina Garza ;
Moreno Blas, Daniel ;
Castillo Gonzalez, David A. ;
Corzo, Gerardo ;
Castro-Obregon, Susana ;
Del Rio, Gabriel .
PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (04)