Bridging Machine Learning and Thermodynamics for Accurate pKa Prediction

被引:6
|
作者
Luo, Weiliang [1 ,2 ]
Zhou, Gengmo [2 ,3 ]
Zhu, Zhengdan [2 ]
Yuan, Yannan [2 ]
Ke, Guolin [2 ]
Wei, Zhewei [3 ]
Gao, Zhifeng [2 ]
Zheng, Hang [2 ]
机构
[1] MIT, Dept Chem, Cambridge, MA 02139 USA
[2] DP Technol, Beijing 100089, Peoples R China
[3] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing 100872, Peoples R China
来源
JACS AU | 2024年 / 4卷 / 09期
关键词
pK(a); machine learning; protonation ensemble; pretraining-finetuning strategy; free energy modeling; chemical thermodynamics; MELDRUMS ACID; PROGRAM; ORIGIN; VALUES;
D O I
10.1021/jacsau.4c00271
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Integrating scientific principles into machine learning models to enhance their predictive performance and generalizability is a central challenge in the development of AI for Science. Herein, we introduce Uni-pK(a), a novel framework that successfully incorporates thermodynamic principles into machine learning modeling, achieving high-precision predictions of acid dissociation constants (pK(a)), a crucial task in the rational design of drugs and catalysts, as well as a modeling challenge in computational physical chemistry for small organic molecules. Uni-pK(a) utilizes a comprehensive free energy model to represent molecular protonation equilibria accurately. It features a structure enumerator that reconstructs molecular configurations from pK(a) data, coupled with a neural network that functions as a free energy predictor, ensuring high-throughput, data-driven prediction while preserving thermodynamic consistency. Employing a pretraining-finetuning strategy with both predicted and experimental pK(a) data, Uni-pK(a) not only achieves state-of-the-art accuracy in chemoinformatics but also shows comparable precision to quantum mechanics-based methods.
引用
收藏
页码:3451 / 3465
页数:15
相关论文
共 50 条
  • [21] Accurate prediction of Snare Protein Sequence using Machine Learning
    Talpur, Dani Bux
    Shaikh, Salahuddin
    Khowaja, Ashfaque
    Adnan, Saifullah
    Ghulam, Ali
    BIOSCIENCE RESEARCH, 2022, 19 (03): : 1414 - 1422
  • [22] Accurate prediction of myopic progression and high myopia by machine learning
    Li, Jiahui
    Zeng, Simiao
    Li, Zhihuan
    Xu, Jie
    Sun, Zhuo
    Zhao, Jing
    Li, Meiyan
    Zou, Zixing
    Guan, Taihua
    Zeng, Jin
    Liu, Zhuang
    Xiao, Wenchao
    Wei, Ran
    Miao, Hanpei
    Ziyar, Ian
    Huang, Junxiong
    Gao, Yuanxu
    Zeng, Yangfa
    Zhou, Xing-Tao
    Zhang, Kang
    PRECISION CLINICAL MEDICINE, 2024, 7 (01)
  • [23] Accurate and fast machine learning algorithm for systems outage prediction
    Gu, Chan
    Chen, Chen
    Tang, Wei
    SOLAR ENERGY, 2023, 251 (286-294) : 286 - 294
  • [24] Bridging Chemical Knowledge and Machine Learning for Performance Prediction of Organic Synthesis
    Zhang, Shuo-Qing
    Xu, Li-Cheng
    Li, Shu-Wen
    Oliveira, Joao C. A.
    Li, Xin
    Ackermann, Lutz
    Hong, Xin
    CHEMISTRY-A EUROPEAN JOURNAL, 2023, 29 (06)
  • [25] Bridging a translational gap: using machine learning to improve the prediction of PTSD
    Karen-Inge Karstoft
    Isaac R Galatzer-Levy
    Alexander Statnikov
    Zhiguo Li
    Arieh Y Shalev
    BMC Psychiatry, 15
  • [26] Bridging a translational gap: using machine learning to improve the prediction of PTSD
    Karstoft, Karen-Inge
    Galatzer-Levy, Isaac R.
    Statnikov, Alexander
    Li, Zhiguo
    Shalev, Arieh Y.
    BMC PSYCHIATRY, 2015, 15
  • [27] Accurate pKa and proton state prediction using Epik
    Shelley, John C.
    Greenwood, Jeremy R.
    Timlin, Matt
    Uchiyama, Makoto
    Shelley, Mee
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232 : 287 - 287
  • [28] Protein pKa predictions with machine learning
    Shen, Mingzhe
    Liu, Ruibin
    Shen, Jana
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 549A - 549A
  • [29] SPECIAL TOPIC-Machine learning in biomolecular simulations Progress in protein pKa prediction
    Luo, Fang-Fang
    Cai, Zhi-Tao
    Huang, Yan-Dong
    ACTA PHYSICA SINICA, 2023, 72 (24)
  • [30] Prediction of protein pKa with representation learning
    Gokcan, Hatice
    Isayev, Olexandr
    CHEMICAL SCIENCE, 2022, 13 (08) : 2462 - 2474