Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction

被引:6
|
作者
Luo, Weiliang [1 ,2 ]
Zhou, Gengmo [2 ,3 ]
Zhu, Zhengdan [2 ]
Yuan, Yannan [2 ]
Ke, Guolin [2 ]
Wei, Zhewei [3 ]
Gao, Zhifeng [2 ]
Zheng, Hang [2 ]
机构
[1] MIT, Dept Chem, Cambridge, MA 02139 USA
[2] DP Technol, Beijing 100089, Peoples R China
[3] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing 100872, Peoples R China
来源
JACS AU | 2024年 / 4卷 / 09期
关键词
pK(a); machine learning; protonation ensemble; pretraining-finetuning strategy; free energy modeling; chemical thermodynamics; MELDRUMS ACID; PROGRAM; ORIGIN; VALUES;
D O I
10.1021/jacsau.4c00271
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Integrating scientific principles into machine learning models to enhance their predictive performance and generalizability is a central challenge in the development of AI for Science. Herein, we introduce Uni-pK(a), a novel framework that successfully incorporates thermodynamic principles into machine learning modeling, achieving high-precision predictions of acid dissociation constants (pK(a)), a crucial task in the rational design of drugs and catalysts, as well as a modeling challenge in computational physical chemistry for small organic molecules. Uni-pK(a) utilizes a comprehensive free energy model to represent molecular protonation equilibria accurately. It features a structure enumerator that reconstructs molecular configurations from pK(a) data, coupled with a neural network that functions as a free energy predictor, ensuring high-throughput, data-driven prediction while preserving thermodynamic consistency. Employing a pretraining-finetuning strategy with both predicted and experimental pK(a) data, Uni-pK(a) not only achieves state-of-the-art accuracy in chemoinformatics but also shows comparable precision to quantum mechanics-based methods.
引用
收藏
页码:3451 / 3465
页数:15
相关论文
共 50 条
  • [31] Leveraging ChemBERTa and machine learning for accurate toxicity prediction of ionic liquids
    Sadaghiyanfam, Safa
    Kamberaj, Hiqmet
    Isler, Yalcin
    JOURNAL OF THE TAIWAN INSTITUTE OF CHEMICAL ENGINEERS, 2025, 171
  • [32] Accurate Load Prediction Algorithms Assisted with Machine Learning for Network Traffic
    Gao, Yin
    Zhang, Man
    Chen, Jiajun
    Han, Jiren
    Li, Dapeng
    Qiu, Ruitao
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 1683 - 1688
  • [33] Integrating graphology and machine learning for accurate prediction of personality: a novel approach
    Bandhu, Kailash Chandra
    Litoriya, Ratnesh
    Khatri, Mihir
    Kaul, Milind
    Soni, Prakhar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (30) : 46457 - 46481
  • [34] A novel ensemble machine learning method for accurate air quality prediction
    Emec, M.
    Yurtsever, M.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2025, 22 (01) : 459 - 476
  • [35] ACCURATE PREDICTION OF SURVIVAL IN GLIOBLASTOMA MULTIFORME. A MACHINE LEARNING TOOL
    Ruiz-Patino, A.
    NEURO-ONCOLOGY, 2017, 19 : 80 - 80
  • [36] The accurate prediction and characterization of cancerlectin by a combined machine learning and GO analysis
    Tang, Furong
    Zhang, Lichao
    Xu, Lei
    Zou, Quan
    Feng, Hailin
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [37] In Pursuit of Interpretable, Fair and Accurate Machine Learning for Criminal Recidivism Prediction
    Caroline Wang
    Bin Han
    Bhrij Patel
    Cynthia Rudin
    Journal of Quantitative Criminology, 2023, 39 : 519 - 581
  • [38] Accurate solubility prediction with error bars for electrolytes:: A machine learning approach
    Schwaighofer, Anton
    Schroeter, Timon
    Mika, Sebastian
    Laub, Julian
    ter Laak, Antonius
    Suelzle, Detlev
    Ganzer, Ursula
    Heinrich, Nikolaus
    Mueller, Klaus-Robert
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (02) : 407 - 424
  • [39] Applied Machine Learning Algorithms for Courtyards Thermal Patterns Accurate Prediction
    Diz-Mellado, Eduardo
    Rubino, Samuele
    Fernandez-Garcia, Soledad
    Gomez-Marmol, Macarena
    Rivera-Gomez, Carlos
    Galan-Marin, Carmen
    MATHEMATICS, 2021, 9 (10)
  • [40] Machine Learning Deciphered Molecular Mechanistics with Accurate Kinetic and Thermodynamic Prediction
    Dong, Junlin
    Wang, Shiyu
    Cui, Wenqiang
    Sun, Xiaolin
    Guo, Haojie
    Yan, Hailu
    Vogel, Horst
    Wang, Zhi
    Yuan, Shuguang
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, 20 (11) : 4499 - 4513