Benchmarking graph neural networks for materials chemistry

被引:220
作者
Fung, Victor [1 ]
Zhang, Jiaxin [2 ]
Juarez, Eric [1 ]
Sumpter, Bobby G. [1 ]
机构
[1] Oak Ridge Natl Lab, Ctr Nanophase Mat Sci, Oak Ridge, TN 37830 USA
[2] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN USA
关键词
MACHINE; REPOSITORY; MOLECULES;
D O I
10.1038/s41524-021-00554-0
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Graph neural networks (GNNs) have received intense interest as a rapidly expanding class of machine learning models remarkably well-suited for materials applications. To date, a number of successful GNNs have been proposed and demonstrated for systems ranging from crystal stability to electronic property prediction and to surface chemistry and heterogeneous catalysis. However, a consistent benchmark of these models remains lacking, hindering the development and consistent evaluation of new models in the materials field. Here, we present a workflow and testing platform, MatDeepLearn, for quickly and reproducibly assessing and comparing GNNs and other machine learning models. We use this platform to optimize and evaluate a selection of top performing GNNs on several representative datasets in computational materials chemistry. From our investigations we note the importance of hyperparameter selection and find roughly similar performances for the top models once optimized. We identify several strengths in GNNs over conventional models in cases with compositionally diverse datasets and in its overall flexibility with respect to inputs, due to learned rather than defined representations. Meanwhile several weaknesses of GNNs are also observed including high data requirements, and suggestions for further improvement for applications in materials chemistry are discussed.
引用
收藏
页数:8
相关论文
共 53 条
[11]   Open Catalyst 2020 (OC20) Dataset and Community Challenges [J].
Chanussot, Lowik ;
Das, Abhishek ;
Goyal, Siddharth ;
Lavril, Thibaut ;
Shuaibi, Muhammed ;
Riviere, Morgane ;
Tran, Kevin ;
Heras-Domingo, Javier ;
Ho, Caleb ;
Hu, Weihua ;
Palizhati, Aini ;
Sriram, Anuroop ;
Wood, Brandon ;
Yoon, Junwoong ;
Parikh, Devi ;
Zitnick, C. Lawrence ;
Ulissi, Zachary .
ACS CATALYSIS, 2021, 11 (10) :6059-6072
[12]   Learning properties of ordered and disordered materials from multi-fidelity data [J].
Chen, Chi ;
Zuo, Yunxing ;
Ye, Weike ;
Li, Xiangguo ;
Ong, Shyue Ping .
NATURE COMPUTATIONAL SCIENCE, 2021, 1 (01) :46-+
[13]   A Critical Review of Machine Learning of Energy Materials [J].
Chen, Chi ;
Zuo, Yunxing ;
Ye, Weike ;
Li, Xiangguo ;
Deng, Zhi ;
Ong, Shyue Ping .
ADVANCED ENERGY MATERIALS, 2020, 10 (08)
[14]   Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals [J].
Chen, Chi ;
Ye, Weike ;
Zuo, Yunxing ;
Zheng, Chen ;
Ong, Shyue Ping .
CHEMISTRY OF MATERIALS, 2019, 31 (09) :3564-3572
[15]   The joint automated repository for various integrated simulations (JARVIS) for data-driven materials design [J].
Choudhary, Kamal ;
Garrity, Kevin F. ;
Reid, Andrew C. E. ;
DeCost, Brian ;
Biacchi, Adam J. ;
Hight Walker, Angela R. ;
Trautt, Zachary ;
Hattrick-Simpers, Jason ;
Kusne, A. Gilad ;
Centrone, Andrea ;
Davydov, Albert ;
Jiang, Jie ;
Pachter, Ruth ;
Cheon, Gowoon ;
Reed, Evan ;
Agrawal, Ankit ;
Qian, Xiaofeng ;
Sharma, Vinit ;
Zhuang, Houlong ;
Kalinin, Sergei V. ;
Sumpter, Bobby G. ;
Pilania, Ghanshyam ;
Acar, Pinar ;
Mandal, Subhasish ;
Haule, Kristjan ;
Vanderbilt, David ;
Rabe, Karin ;
Tavazza, Francesca .
NPJ COMPUTATIONAL MATERIALS, 2020, 6 (01)
[16]   Benchmark AFLOW Data Sets for Machine Learning [J].
Clement, Conrad L. ;
Kauwe, Steven K. ;
Sparks, Taylor D. .
INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2020, 9 (02) :153-156
[17]   AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations [J].
Curtarolo, Stefano ;
Setyawan, Wahyu ;
Wang, Shidong ;
Xue, Junkai ;
Yang, Kesong ;
Taylor, Richard H. ;
Nelson, Lance J. ;
Hart, Gus L. W. ;
Sanvito, Stefano ;
Buongiorno-Nardelli, Marco ;
Mingo, Natalio ;
Levy, Ohad .
COMPUTATIONAL MATERIALS SCIENCE, 2012, 58 :227-235
[18]   Comparing molecules and solids across structural and alchemical space [J].
De, Sandip ;
Bartok, Albert P. ;
Csanyi, Gabor ;
Ceriotti, Michele .
PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2016, 18 (20) :13754-13769
[19]   The NOMAD laboratory: from data sharing to artificial intelligence [J].
Draxl, Claudia ;
Scheffler, Matthias .
JOURNAL OF PHYSICS-MATERIALS, 2019, 2 (03)
[20]  
Dunn A, 2020, NPJ COMPUT MATER, V6, DOI 10.1038/s41524-020-00406-3