Prediction of Thermostability of Enzymes Based on the Amino Acid Index (AAindex) Database and Machine Learning

被引:2
|
作者
Li, Gaolin [1 ]
Jia, Lili [2 ]
Wang, Kang [3 ]
Sun, Tingting [3 ]
Huang, Jun [1 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Biol & Chem Engn, Hangzhou 310023, Peoples R China
[2] China Natl Rice Res Inst, State Key Lab Rice Biol & Breeding, Hangzhou 311400, Peoples R China
[3] Zhejiang Univ Sci & Technol, Dept Phys, Hangzhou 310023, Peoples R China
来源
MOLECULES | 2023年 / 28卷 / 24期
基金
中国国家自然科学基金;
关键词
artificial intelligence; machine learning; thermostability; molecular dynamics simulation; extended sequence; directed evolution; STABILITY CHANGES; PROTEIN; BIOCATALYSIS;
D O I
10.3390/molecules28248097
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The combination of wet-lab experimental data on multi-site combinatorial mutations and machine learning is an innovative method in protein engineering. In this study, we used an innovative sequence-activity relationship (innov'SAR) methodology based on novel descriptors and digital signal processing (DSP) to construct a predictive model. In this paper, 21 experimental (R)-selective amine transaminases from Aspergillus terreus (AT-ATA) were used as an input to predict higher thermostability mutants than those predicted using the existing data. We successfully improved the coefficient of determination (R2) of the model from 0.66 to 0.92. In addition, root-mean-squared deviation (RMSD), root-mean-squared fluctuation (RMSF), solvent accessible surface area (SASA), hydrogen bonds, and the radius of gyration were estimated based on molecular dynamics simulations, and the differences between the predicted mutants and the wild-type (WT) were analyzed. The successful application of the innov'SAR algorithm in improving the thermostability of AT-ATA may help in directed evolutionary screening and open up new avenues for protein engineering.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Multiple machine learning models for prediction of CO2 solubility in potassium and sodium based amino acid salt solutions
    Yin, Guanwei
    Alazzawi, Fouad Jameel Ibrahim
    Bokov, Dmitry
    Marhoon, Haydar Abdulameer
    El-Shafay, A. S.
    Rahman, Md Lutfor
    Su, Chia-Hung
    Lu, Yi-Ze
    Hoang Chinh Nguyen
    ARABIAN JOURNAL OF CHEMISTRY, 2022, 15 (03)
  • [22] A Feasibility Study for the Prediction of Concrete Pavement Condition Index (CPCI) Based on Machine Learning
    Lee, Jin-Hyuk
    Jung, Dong-Hyuk
    Lee, Moon-Sub
    Jeon, Sung-Il
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [23] Postoperative Apnea-Hypopnea Index Prediction of Velopharyngeal Surgery Based on Machine Learning
    You, Jingyuan
    Li, Juan
    Zhou, Yingqian
    Cao, Xin
    Zhao, Chunmei
    Zhang, Yuhuan
    Ye, Jingying
    OTO OPEN, 2025, 9 (01)
  • [24] Properties prediction of composites based on machine learning models: A focus on statistical index approaches
    Dev, Barshan
    Rahman, Md Ashikur
    Islam, Md. Jahidul
    Rahman, Md Zillur
    Zhu, Deju
    MATERIALS TODAY COMMUNICATIONS, 2024, 38
  • [25] Quantitative trading strategy based on IVIX Index prediction and recurrence: Machine Learning Perspective
    He, Xiangyu
    Yang, Nan
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 230 - 238
  • [26] Machine learning-based prediction of activity and substrate specificity for OleA enzymes in the thiolase superfamily
    Robinson, Serina L.
    Smith, Megan D.
    Richman, Jack E.
    Aukema, Kelly G.
    Wackett, Lawrence P.
    SYNTHETIC BIOLOGY, 2020, 5 (01)
  • [27] Assessment and prediction of index based agricultural drought vulnerability using machine learning algorithms
    Abdulla-Al Kafy
    Bakshi, Arpita
    Saha, Milan
    Al Faisal, Abdullah
    Almulhim, Abdulaziz I.
    Rahaman, Zullyadini A.
    Mohammad, Pir
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 867
  • [28] PMTPred: machine-learning-based prediction of protein methyltransferases using the composition of k-spaced amino acid pairs
    Yadav, Arvind Kumar
    Gupta, Pradeep Kumar
    Singh, Tiratha Raj
    MOLECULAR DIVERSITY, 2024, 28 (04) : 2301 - 2315
  • [29] Reliable prediction of protein thermostability change upon double mutation from amino acid sequence
    Huang, Liang-Tsung
    Gromiha, M. Michael
    BIOINFORMATICS, 2009, 25 (17) : 2181 - 2187
  • [30] Sales Prediction based on Machine Learning
    Huo, Zixuan
    2021 2ND INTERNATIONAL CONFERENCE ON E-COMMERCE AND INTERNET TECHNOLOGY (ECIT 2021), 2021, : 410 - 415