Bridging the gap between high-level quantum chemical methods and deep learning models

被引:1
|
作者
Prasad, Viki Kumar [1 ,4 ,5 ]
Otero-de-la-Roza, Alberto [2 ,3 ]
DiLabio, Gino A. [1 ]
机构
[1] Univ British Columbia, Dept Chem, 3247 Univ Way, Kelowna, BC V1V 1V7, Canada
[2] Univ Oviedo, Fac Quim, Dept Quim Fis & Analit, Oviedo 33006, Spain
[3] Univ Oviedo, Fac Quim, MALTA Consolider Team, Oviedo 33006, Spain
[4] Univ Toronto Mississauga, Dept Chem & Phys Sci, 3359 Mississauga Rd, Mississauga, ON L5L 1C6, Canada
[5] Univ Toronto, Edward S Rogers Sr Dept Elect & Comp Engn, 10 Kings Coll Rd, Toronto, ON M5S 3G4, Canada
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 01期
关键词
machine learning; deep learning; chemistry; thermochemistry; training set; data generation; atom-centered potentials; DENSITY-FUNCTIONAL THEORY; MAIN-GROUP THERMOCHEMISTRY; BASIS-SETS; CHEMISTRY; KINETICS;
D O I
10.1088/2632-2153/ad27e1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised deep learning (DL) models are becoming ubiquitous in computational chemistry because they can efficiently learn complex input-output relationships and predict chemical properties at a cost significantly lower than methods based on quantum mechanics. The central challenge in many DL applications is the need to invest considerable computational resources in generating large ( N>1x10(5) ) training sets such that the resulting DL model can be generalized reliably to unseen systems. The lack of better alternatives has encouraged the use of low-cost and relatively inaccurate density-functional theory (DFT) methods to generate training data, leading to DL models that lack accuracy and reliability. In this article, we describe a robust and easily implemented approach based on property-specific atom-centered potentials (ACPs) that resolves this central challenge in DL model development. ACPs are one-electron potentials that are applied in combination with a computationally inexpensive but inaccurate quantum mechanical method (e.g. double-zeta DFT) and fitted against relatively few high-level data ( N approximate to 1x10(3) - 1x10(4 )), possibly obtained from the literature. The resulting ACP-corrected methods retain the low cost of the double-zeta DFT approach, while generating high-level-quality data in unseen systems for the specific property for which they were designed. With this approach, we demonstrate that ACPs can be used as an intermediate method between high-level approaches and DL model development, enabling the calculation of large and accurate DL training sets for the chemical property of interest. We demonstrate the effectiveness of the proposed approach by predicting bond dissociation enthalpies, reaction barrier heights, and reaction energies with chemical accuracy at a computational cost lower than the DFT methods routinely used for DL training data set generation.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Bridging the Gap between High-Level Reasoning and Low-Level Control
    Caldiran, Ozan
    Haspalamutgil, Kadir
    Ok, Abdullah
    Palaz, Can
    Erdem, Esra
    Patoglu, Volkan
    LOGIC PROGRAMMING AND NONMONOTONIC REASONING, PROCEEDINGS, 2009, 5753 : 342 - 354
  • [2] High-level quantum chemical methods for the study of photochemical processes
    Lischka, H
    Aquino, AJA
    Barbatti, M
    Solimannejad, M
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 1, 2005, 3480 : 1004 - 1011
  • [3] A Teaching Approach for Bridging the Gap Between Low-Level and High-Level Programming Using Assembly Language Learning for Small Microcontrollers
    Bolanakis, Dimosthenis E.
    Evangelakis, Georgios A.
    Glavas, Euripidis
    Kotsis, Konstantinos T.
    COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2011, 19 (03) : 525 - 537
  • [4] Bridging the Diagnostic Gap between Histopathologic and Hysteroscopic Chronic Endometritis with Deep Learning Models
    Kitaya, Kotaro
    Yasuo, Tadahiro
    Yamaguchi, Takeshi
    MEDICINA-LITHUANIA, 2024, 60 (06):
  • [5] Bridging the gap between debiasing and privacy for deep learning
    Barbano, Carlo Alberto
    Tartaglione, Enzo
    Grangetto, Marco
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3799 - 3808
  • [6] Bridging the Gap Between High-Level Reasoning in Strategic Agent Coordination and Low-Level Agent Development
    Bondi, Elizabeth
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2402 - 2404
  • [7] High-level methods for quantum computation and information
    Abramsky, S
    19TH ANNUAL IEEE SYMPOSIUM ON LOGIC IN COMPUTER SCIENCE, PROCEEDINGS, 2004, : 410 - 414
  • [8] From low-level features to high-level semantics: Are we bridging the gap?
    Chen, TH
    ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 179 - 179
  • [9] “Transfer Learning” for Bridging the Gap Between Data Sciences and the Deep Learning
    Sohail A.
    Annals of Data Science, 2024, 11 (01) : 337 - 345
  • [10] From High-Level Deep Neural Models to FPGAs
    Sharma, Hardik
    Park, Jongse
    Mahajan, Divya
    Amaro, Emmanuel
    Kim, Joon Kyung
    Shao, Chenkai
    Mishra, Asit
    Esmaeilzadeh, Hadi
    2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,