Molecular-orbital-based machine learning for open-shell and multi-reference systems with kernel addition Gaussian process regression

被引:9
作者
Cheng, Lixue [1 ]
Sun, Jiace [1 ]
Deustua, J. Emiliano [1 ]
Bhethanabotla, Vignesh C. [1 ]
Miller, Thomas F., III [1 ]
机构
[1] CALTECH, Div Chem & Chem Engn, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
DENSITY-FUNCTIONAL THEORY; AB-INITIO; ENERGIES; T-1;
D O I
10.1063/5.0110886
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
We introduce a novel machine learning strategy, kernel addition Gaussian process regression (KA-GPR), in molecular-orbital-based machine learning (MOB-ML) to learn the total correlation energies of general electronic structure theories for closed- and open-shell systems by introducing a machine learning strategy. The learning efficiency of MOB-ML(KA-GPR) is the same as the original MOB-ML method for the smallest criegee molecule, which is a closed-shell molecule with multi-reference characters. In addition, the prediction accuracies of different small free radicals could reach the chemical accuracy of 1 kcal/mol by training on one example structure. Accurate potential energy surfaces for the H-10 chain (closed-shell) and water OH bond dissociation (open-shell) could also be generated by MOB-ML(KA-GPR). To explore the breadth of chemical systems that KA-GPR can describe, we further apply MOB-ML to accurately predict the large benchmark datasets for closed- (QM9, QM7b-T, and GDB-13-T) and open-shell (QMSpin) molecules. Published under an exclusive license by AIP Publishing.
引用
收藏
页数:10
相关论文
共 66 条
[1]   Hybrid density functional theory predictions of low-temperature dimethyl ether combustion pathways. II. Chain-branching energetics and possible role of the Criegee intermediate [J].
Andersen, A ;
Carter, EA .
JOURNAL OF PHYSICAL CHEMISTRY A, 2003, 107 (44) :9463-9478
[2]   DENSITY-FUNCTIONAL THERMOCHEMISTRY .3. THE ROLE OF EXACT EXCHANGE [J].
BECKE, AD .
JOURNAL OF CHEMICAL PHYSICS, 1993, 98 (07) :5648-5652
[3]   Random Sampling High Dimensional Model Representation Gaussian Process Regression (RS-HDMR-GPR) for Multivariate Function Representation: Application to Molecular Potential Energy Surfaces [J].
Boussaidi, Mohamed Ali ;
Ren, Owen ;
Voytsekhovsky, Dmitry ;
Manzhos, Sergei .
JOURNAL OF PHYSICAL CHEMISTRY A, 2020, 124 (37) :7598-7607
[4]  
Cheng L., 2022, CALTECHDATA DATASET, DOI [10.22002/D1.20200, DOI 10.22002/D1.20200]
[5]  
Cheng L, 2022, DIPOLE MOMENTS MOL E
[6]  
Cheng L, 2019, THERMALIZED 350K QM7, DOI [10.22002/D1.1177, DOI 10.22002/D1.1177]
[7]   Accurate Molecular-Orbital-Based Machine Learning Energies via Unsupervised Clustering of Chemical Space [J].
Cheng, Lixue ;
Sun, Jiace ;
Miller, Thomas F., III .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, 18 (08) :4826-4835
[8]   Regression Clustering for Improved Accuracy and Training Costs with Molecular-Orbital-Based Machine Learning [J].
Cheng, Lixue ;
Kovachki, Nikola B. ;
Welborn, Matthew ;
Miller, Thomas F., III .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (12) :6668-6677
[9]   A universal density matrix functional from molecular orbital-based machine learning: Transferability across organic molecules [J].
Cheng, Lixue ;
Welborn, Matthew ;
Christensen, Anders S. ;
Miller, Thomas F., III .
JOURNAL OF CHEMICAL PHYSICS, 2019, 150 (13)
[10]   Putting Density Functional Theory to the Test in Machine-Learning-Accelerated Materials Discovery [J].
Duan, Chenru ;
Liu, Fang ;
Nandy, Aditya ;
Kulik, Heather J. .
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2021, 12 (19) :4628-4637