Prediction of Thermostability of Enzymes Based on the Amino Acid Index (AAindex) Database and Machine Learning

被引:2
|
作者
Li, Gaolin [1 ]
Jia, Lili [2 ]
Wang, Kang [3 ]
Sun, Tingting [3 ]
Huang, Jun [1 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Biol & Chem Engn, Hangzhou 310023, Peoples R China
[2] China Natl Rice Res Inst, State Key Lab Rice Biol & Breeding, Hangzhou 311400, Peoples R China
[3] Zhejiang Univ Sci & Technol, Dept Phys, Hangzhou 310023, Peoples R China
来源
MOLECULES | 2023年 / 28卷 / 24期
基金
中国国家自然科学基金;
关键词
artificial intelligence; machine learning; thermostability; molecular dynamics simulation; extended sequence; directed evolution; STABILITY CHANGES; PROTEIN; BIOCATALYSIS;
D O I
10.3390/molecules28248097
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The combination of wet-lab experimental data on multi-site combinatorial mutations and machine learning is an innovative method in protein engineering. In this study, we used an innovative sequence-activity relationship (innov'SAR) methodology based on novel descriptors and digital signal processing (DSP) to construct a predictive model. In this paper, 21 experimental (R)-selective amine transaminases from Aspergillus terreus (AT-ATA) were used as an input to predict higher thermostability mutants than those predicted using the existing data. We successfully improved the coefficient of determination (R2) of the model from 0.66 to 0.92. In addition, root-mean-squared deviation (RMSD), root-mean-squared fluctuation (RMSF), solvent accessible surface area (SASA), hydrogen bonds, and the radius of gyration were estimated based on molecular dynamics simulations, and the differences between the predicted mutants and the wild-type (WT) were analyzed. The successful application of the innov'SAR algorithm in improving the thermostability of AT-ATA may help in directed evolutionary screening and open up new avenues for protein engineering.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Amino Acid Encoding Schemes for Machine Learning Methods
    Zamani, Masood
    Kremer, Stefan C.
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 327 - 333
  • [32] A Machine Learning Study on the Thermostability Prediction of (R)-ω-Selective Amine Transaminase from Aspergillus terreus
    Jia, Li-li
    Sun, Ting-Ting
    Wang, Yan
    Shen, Yu
    BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [33] Salicylic acid solubility prediction in different solvents based on machine learning algorithms
    Hashemi, Seyed Hossein
    Besharati, Zahra
    Hashemi, Seyed Abdolrasoul
    DIGITAL CHEMICAL ENGINEERING, 2024, 11
  • [34] Prediction of protein-peptide-binding amino acid residues regions using machine learning algorithms
    Shafiee, Shima
    Fathi, Abdolhossein
    2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,
  • [35] Analysis of hot regions prediction in PPI with different amino acid mutation using machine learning algorithm
    Hu, Jing
    Gan, Haomin
    Zhang, Xiaolong
    Chen, Nansheng
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2254 - 2261
  • [36] Machine Learning based Rainfall Prediction
    Grace, R. Kingsy
    Suganya, B.
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 227 - 229
  • [37] Network Intrusion Detection Based on Amino Acid Sequence Structure Using Machine Learning
    Ibaisi, Thaer A. L.
    Kuhn, Stefan
    Kaiiali, Mustafa
    Kazim, Muhammad
    ELECTRONICS, 2023, 12 (20)
  • [38] Analysis and Prediction of Colorectal Cancer Based on Machine Learning Algorithms
    Chen, Yanming
    He, Xiaolin
    Lin, Chuan
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 279 - 283
  • [39] Machine Learning-Based Prediction of Stroke in Emergency Departments
    Abedi, Vida
    Misra, Debdipto
    Chaudhary, Durgesh
    Avula, Venkatesh
    Schirmer, Clemens M.
    Li, Jiang
    Zand, Ramin
    THERAPEUTIC ADVANCES IN NEUROLOGICAL DISORDERS, 2024, 17
  • [40] Ischemia and outcome prediction by cardiac CT based machine learning
    Brandt, Verena
    Emrich, Tilman
    Schoepf, U. Joseph
    Dargis, Danielle M.
    Bayer, Richard R.
    De Cecco, Carlo N.
    Tesche, Christian
    INTERNATIONAL JOURNAL OF CARDIOVASCULAR IMAGING, 2020, 36 (12) : 2429 - 2439