Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction

被引:6
|
作者
Luo, Weiliang [1 ,2 ]
Zhou, Gengmo [2 ,3 ]
Zhu, Zhengdan [2 ]
Yuan, Yannan [2 ]
Ke, Guolin [2 ]
Wei, Zhewei [3 ]
Gao, Zhifeng [2 ]
Zheng, Hang [2 ]
机构
[1] MIT, Dept Chem, Cambridge, MA 02139 USA
[2] DP Technol, Beijing 100089, Peoples R China
[3] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing 100872, Peoples R China
来源
JACS AU | 2024年 / 4卷 / 09期
关键词
pK(a); machine learning; protonation ensemble; pretraining-finetuning strategy; free energy modeling; chemical thermodynamics; MELDRUMS ACID; PROGRAM; ORIGIN; VALUES;
D O I
10.1021/jacsau.4c00271
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Integrating scientific principles into machine learning models to enhance their predictive performance and generalizability is a central challenge in the development of AI for Science. Herein, we introduce Uni-pK(a), a novel framework that successfully incorporates thermodynamic principles into machine learning modeling, achieving high-precision predictions of acid dissociation constants (pK(a)), a crucial task in the rational design of drugs and catalysts, as well as a modeling challenge in computational physical chemistry for small organic molecules. Uni-pK(a) utilizes a comprehensive free energy model to represent molecular protonation equilibria accurately. It features a structure enumerator that reconstructs molecular configurations from pK(a) data, coupled with a neural network that functions as a free energy predictor, ensuring high-throughput, data-driven prediction while preserving thermodynamic consistency. Employing a pretraining-finetuning strategy with both predicted and experimental pK(a) data, Uni-pK(a) not only achieves state-of-the-art accuracy in chemoinformatics but also shows comparable precision to quantum mechanics-based methods.
引用
收藏
页码:3451 / 3465
页数:15
相关论文
共 50 条
  • [1] Basis for Accurate Protein pKa Prediction with Machine Learning
    Cai, Zhitao
    Liu, Tengzi
    Lin, Qiaoling
    He, Jiahao
    Lei, Xiaowei
    Luo, Fangfang
    Huang, Yandong
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (10) : 2936 - 2947
  • [2] DFT-Machine Learning Approach for Accurate Prediction of pKa
    Lawler, Robin
    Liu, Yao-Hao
    Majaya, Nessa
    Allam, Omar
    Ju, Hyunchul
    Kim, Jin Young
    Jang, Seung Soon
    JOURNAL OF PHYSICAL CHEMISTRY A, 2021, 125 (39): : 8712 - 8722
  • [3] Protein pKa Prediction with Machine Learning
    Cai, Zhitao
    Luo, Fangfang
    Wang, Yongxian
    Li, Enling
    Huang, Yandong
    ACS OMEGA, 2021, 6 (50): : 34823 - 34831
  • [4] Epik: pKa and Protonation State Prediction through Machine Learning
    Johnston, Ryne C.
    Yao, Kun
    Kaplan, Zachary
    Chelliah, Monica
    Leswing, Karl
    Seekins, Sean
    Watts, Shawn
    Calkins, David
    Elk, Jackson Chief
    Jerome, Steven, V
    Repasky, Matthew P.
    Shelley, John C.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2023, 19 (08) : 2380 - 2388
  • [5] Protein pKa Prediction by Tree-Based Machine Learning
    Chen, Ada Y.
    Lee, Juyong
    Damjanovic, Ana
    Brooks, Bernard R.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, 18 (04) : 2673 - 2686
  • [6] Machine learning methods for pKa prediction of small molecules: Advances and challenges
    Wu, Jialu
    Kang, Yu
    Pan, Peichen
    Hou, Tingjun
    DRUG DISCOVERY TODAY, 2022, 27 (12)
  • [7] Ensemble Machine Learning Framework for Accurate Flood Prediction
    Varghese, Akanksha
    Gupta, Vijay Baboo
    Saxena, Mayank
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [8] Accurate Prediction of Microstructure of Composites using Machine Learning
    Sang, Sheng
    Xu, Chen
    Fan, Jiadi
    Miao, Daniel
    Side, Conner
    Wang, Ziping
    ADVANCED THEORY AND SIMULATIONS, 2023, 6 (02)
  • [9] Machine Learning in Thermodynamics: Prediction of Activity Coefficients by Matrix Completion
    Jirasek, Fabian
    Alves, Rodrigo A. S.
    Damay, Julie
    Vandermeulen, Robert A.
    Bamler, Robert
    Bortz, Michael
    Mandt, Stephan
    Kloft, Marius
    Hasse, Hans
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2020, 11 (03): : 981 - 985
  • [10] Holistic Prediction of the pKa in Diverse Solvents Based on a Machine-Learning Approach
    Yang, Qi
    Li, Yao
    Yang, Jin-Dong
    Liu, Yidi
    Zhang, Long
    Luo, Sanzhong
    Cheng, Jin-Pei
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2020, 59 (43) : 19282 - 19291