PREDICTION OF FORMATION ENERGY USING TWO-STAGE MACHINE LEARNING BASED ON CLUSTERING

被引:2
作者
Fan, Xingyue [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
来源
MATERIALI IN TEHNOLOGIJE | 2021年 / 55卷 / 02期
关键词
ABO3-type perovskites; formation energy; hierarchical clustering; regression model;
D O I
10.17222/mit.2020.174
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The formation energy (Hf) is one of the important properties associated with the thermodynamic stability of ABO3-type perovskite. In this work, two-stage machine learning based on hierarchical clustering and regression was designed for improving the prediction values of the density-functional theory (DFT) Hf of ABO3-type perovskites. A global dataset was clustered into Cluster 1 and Cluster 2 using the CHI (the Calinski-Harabasz index). To compare the prediction performances of Hf, DTR (decision tree regression), GBRT (gradient boosted regression trees), RFR (random forest regression) and ETR (extra tree regression) were applied to build models of Cluster 1, Cluster 2 and the global dataset, respectively. The results showed that all four different regression models of Cluster 1 had a higher R-2, and lower MSE and MAE than those of the global dataset, while the models of Cluster 2 were poorer. Meanwhile, the GBRT model of Cluster 1 achieved a higher R-2 of 0.917, and lower MSE and MAE of 0.033 eV/atom and 0.125 eV/atom. We further validated and compared the generalization ability of the models by predicting the Hf of ABO3-type perovskite previously unseen in the training set. The two-stage machine-learning models proposed here can provide useful guidance for accelerating the exploration of materials with desired properties.
引用
收藏
页码:263 / 268
页数:6
相关论文
共 25 条
[11]   Identifying Pb-free perovskites for solar cells by machine learning [J].
Im, Jino ;
Lee, Seongwon ;
Ko, Tae-Wook ;
Kim, Hyun Woo ;
Hyon, YunKyong ;
Chang, Hyunju .
NPJ COMPUTATIONAL MATERIALS, 2019, 5 (1)
[12]   Grouping materials and processes for the designer: an application of cluster analysis [J].
Johnson, KW ;
Langdon, PM ;
Ashby, MF .
MATERIALS & DESIGN, 2002, 23 (01) :1-10
[13]   The Open Quantum Materials Database (OQMD): assessing the accuracy of DFT formation energies [J].
Kirklin, Scott ;
Saal, James E. ;
Meredig, Bryce ;
Thompson, Alex ;
Doak, Jeff W. ;
Aykol, Muratahan ;
Ruehl, Stephan ;
Wolverton, Chris .
NPJ COMPUTATIONAL MATERIALS, 2015, 1
[14]   Predicting the thermodynamic stability of perovskite oxides using machine learning models [J].
Li, Wei ;
Jacobs, Ryan ;
Morgan, Dane .
COMPUTATIONAL MATERIALS SCIENCE, 2018, 150 :454-463
[15]   Predicting creep rupture life of Ni-based single crystal superalloys using divide-and-conquer approach based machine learning [J].
Liu, Yue ;
Wu, Junming ;
Wang, Zhichao ;
Lu, Xiao-Gang ;
Avdeev, Maxim ;
Shi, Siqi ;
Wang, Chongyu ;
Yu, Tao .
ACTA MATERIALIA, 2020, 195 :454-467
[16]   Accelerated discovery of stable lead-free hybrid organic-inorganic perovskites via machine learning [J].
Lu, Shuaihua ;
Zhou, Qionghua ;
Ouyang, Yixin ;
Guo, Yilv ;
Li, Qiang ;
Wang, Jinlan .
NATURE COMMUNICATIONS, 2018, 9
[17]   Comparative analysis of local and consensus quantitative structure-activity relationship approaches for the prediction of bioconcentration factor [J].
Piir, G. ;
Sild, S. ;
Maran, U. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2013, 24 (03) :175-199
[18]  
Quinlan J. R., 1986, Machine Learning, V1, P81, DOI 10.1023/A:1022643204877
[19]   Using Data Mining To Search for Perovskite Materials with Higher Specific Surface Area [J].
Shi, Li ;
Chang, Dongping ;
Ji, Xiaobo ;
Lu, Wencong .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (12) :2420-2427
[20]   Identification and analysis of behavioral phenotypes in autism spectrum disorder via unsupervised machine learning [J].
Stevens, Elizabeth ;
Dixon, Dennis R. ;
Novack, Marlena N. ;
Granpeesheh, Doreen ;
Smith, Tristram ;
Linstead, Erik .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 :29-36