A critical examination of compound stability predictions from machine-learned formation energies

被引:178
作者
Bartel, Christopher J. [1 ]
Trewartha, Amalie [1 ]
Wang, Qi [2 ]
Dunn, Alexander [1 ,2 ]
Jain, Anubhav [2 ]
Ceder, Gerbrand [1 ,3 ]
机构
[1] Univ Calif Berkeley, Dept Mat Sci & Engn, Berkeley, CA 94720 USA
[2] Lawrence Berkeley Natl Lab, Energy Technol Area, Berkeley, CA 94720 USA
[3] Lawrence Berkeley Natl Lab, Div Mat Sci, Berkeley, CA 94720 USA
关键词
51;
D O I
10.1038/s41524-020-00362-y
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Machine learning has emerged as a novel tool for the efficient prediction of material properties, and claims have been made that machine-learned models for the formation energy of compounds can approach the accuracy of Density Functional Theory (DFT). The models tested in this work include five recently published compositional models, a baseline model using stoichiometry alone, and a structural model. By testing seven machine learning models for formation energy on stability predictions using the Materials Project database of DFT calculations for 85,014 unique chemical compositions, we show that while formation energies can indeed be predicted well, all compositional models perform poorly on predicting the stability of compounds, making them considerably less useful than DFT for the discovery and design of new solids. Most critically, in sparse chemical spaces where few stoichiometries have stable compounds, only the structural model is capable of efficiently detecting which materials are stable. The nonincremental improvement of structural models compared with compositional models is noteworthy and encourages the use of structural models for materials discovery, with the constraint that for any new composition, the ground-state structure is not known a priori. This work demonstrates that accurate predictions of formation energy do not imply accurate predictions of stability, emphasizing the importance of assessing model performance on stability predictions, for which we provide a set of publicly available tests.
引用
收藏
页数:11
相关论文
共 51 条
[1]   Synthesis of layered LiMnO2 as an electrode for rechargeable lithium batteries [J].
Armstrong, AR ;
Bruce, PG .
NATURE, 1996, 381 (6582) :499-500
[2]   Thermodynamic limit for synthesis of metastable inorganic materials [J].
Aykol, Muratahan ;
Dwaraknath, Shyam S. ;
Sun, Wenhao ;
Persson, Kristin A. .
SCIENCE ADVANCES, 2018, 4 (04)
[3]   The role of decomposition reactions in assessing first-principles predictions of solid stability [J].
Bartel, Christopher J. ;
Weimer, Alan W. ;
Lany, Stephan ;
Musgrave, Charles B. ;
Holder, Aaron M. .
NPJ COMPUTATIONAL MATERIALS, 2019, 5 (1)
[4]   Physical descriptor for the Gibbs energy of inorganic crystalline solids and temperature-dependent materials chemistry [J].
Bartel, Christopher J. ;
Millican, Samantha L. ;
Deml, Ann M. ;
Rumptz, John R. ;
Tumas, William ;
Weimer, Alan W. ;
Lany, Stephan ;
Stevanovic, Vladan ;
Musgrave, Charles B. ;
Holder, Aaron M. .
NATURE COMMUNICATIONS, 2018, 9
[5]   On representing chemical environments [J].
Bartok, Albert P. ;
Kondor, Risi ;
Csanyi, Gabor .
PHYSICAL REVIEW B, 2013, 87 (18)
[6]   Machine learning for molecular and materials science [J].
Butler, Keith T. ;
Davies, Daniel W. ;
Cartwright, Hugh ;
Isayev, Olexandr ;
Walsh, Aron .
NATURE, 2018, 559 (7715) :547-555
[7]   Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals [J].
Chen, Chi ;
Ye, Weike ;
Zuo, Yunxing ;
Zheng, Chen ;
Ong, Shyue Ping .
CHEMISTRY OF MATERIALS, 2019, 31 (09) :3564-3572
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]  
Chollet F., 2015, KERAS
[10]   High-throughput Identification and Characterization of Two-dimensional Materials using Density functional theory [J].
Choudhary, Kamal ;
Kalish, Irina ;
Beams, Ryan ;
Tavazza, Francesca .
SCIENTIFIC REPORTS, 2017, 7