Uncertainty-Quantified Hybrid Machine Learning/Density Functional Theory High Throughput Screening Method for Crystals

被引:39
作者
Noh, Juhwan [1 ]
Gu, Geun Ho [1 ]
Kim, Sungwon [1 ]
Jung, Yousung [1 ,2 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Dept Chem & Biomol Engn, Daejeon 34141, South Korea
[2] Korea Adv Inst Sci & Technol KAIST, Saudi Aramco KAIST CO2 Management Ctr, Daejeon 34141, South Korea
关键词
NEURAL-NETWORKS; PREDICTION; STABILITY;
D O I
10.1021/acs.jcim.0c00003
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Computational high throughput screening (HTS) has emerged as a significant tool in material science to accelerate the discovery of new materials with target properties in recent years. However, despite many successful cases in which HTS led to the novel discovery, currently, the major bottleneck in HTS is a large computational cost of density functional theory (DFT) calculations that scale cubically with system size, limiting the chemical space that can be explored. The present work aims at addressing this computational burden of HTS by presenting a machine learning (ML) framework that can efficiently explore the chemical space. Our model is built upon an existing crystal graph convolutional neural network (CGCNN) to obtain formation energy of a crystal structure but is modified to allow uncertainty quantification for each prediction using the hyperbolic tangent activation function and dropout algorithm (CGCNN-HD). The uncertainty quantification is particularly important since typical usage of CGCNN (due to the lack of gradient implementation) does not involve structural relaxation which could cause substantial prediction errors. The proposed method is benchmarked against an existing application that identified promising photoanode material among the >7,000 hypothetical Mg-Mn-O ternary compounds using all DFT-HTS. In our approach, we perform the approximate HTS using CGCNN-HD and refine the results using full DFT for those selected (denoted as ML/DFT-HTS). The proposed hybrid model reduces the required DFT calculations by a factor of >50 compared to the previous DFT-HTS in making the same discovery of Mg2MnO4, experimentally validated new photoanode material. Further analysis demonstrates that the addition of HD components with uncertainty measures in the CGCNN-HD model increased the discoverability of promising materials relative to all DFT-HTS from 30% (CGCNN) to 68% (CGCNN-HD). The present ML/DFT-HTS with uncertainty quantification can thus be a fast alternative to DFT-HTS for efficient exploration of the vast chemical space.
引用
收藏
页码:1996 / 2003
页数:8
相关论文
共 47 条
[1]  
[Anonymous], NPJ COMPUT MAT
[2]  
[Anonymous], ARXIV190605267
[3]  
[Anonymous], 2020, ARXIV200110591
[4]  
[Anonymous], 2018, ARXIV180603146
[5]  
[Anonymous], INT C LEARNING REPRE
[6]   Machine learning guided design of functional materials with targeted properties [J].
Balachandran, Prasanna, V .
COMPUTATIONAL MATERIALS SCIENCE, 2019, 164 :82-90
[7]  
Bengio Yoshua, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P437, DOI 10.1007/978-3-642-35289-8_26
[8]   Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals [J].
Chen, Chi ;
Ye, Weike ;
Zuo, Yunxing ;
Zheng, Chen ;
Ong, Shyue Ping .
CHEMISTRY OF MATERIALS, 2019, 31 (09) :3564-3572
[9]  
Choudhary K, 2018, PHYS REV MATER, V2, DOI [10.1103/PhysRevMaterials.2.083801, 10.1103/physrevmaterials.2.083801]
[10]   Predicting the volumes of crystals [J].
Chu, Iek-Heng ;
Roychowdhury, Sayan ;
Han, Daehui ;
Jain, Anubhav ;
Ong, Shyue Ping .
COMPUTATIONAL MATERIALS SCIENCE, 2018, 146 :184-192