Classification enhanced machine learning model for energetic stability of binary compounds

被引:0
作者
Liu, Y. K. [1 ,2 ,3 ]
Liu, Z. R. [1 ,2 ,3 ]
Xu, T. F. [1 ,2 ,3 ]
Legut, D. [4 ,5 ]
Yin, X. [6 ]
Zhang, R. F. [1 ,2 ,3 ]
机构
[1] Beihang Univ, Sch Mat Sci & Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Int Res Inst Multidisciplinary Sci, Ctr Integrated Computat Mat Engn, Key Lab High Temp Struct Mat, Beijing 100191, Peoples R China
[3] Beihang Univ, Key Lab High Temp Struct Mat & Coatings Technol, Minist Ind & Informat Technol, Beijing 100191, Peoples R China
[4] VSB Tech Univ Ostrava, IT4Innovat, 17 Listopadu 2172-15, Ostrava 70800, Czech Republic
[5] Charles Univ Prague, Fac Math & Phys, Dept Condensed Matter Phys, Ke Karlovu 3, Prague 2, Czech Republic
[6] Nucl Power Inst China, Natl Key Lab Nucl Reactor Technol, Chengdu 610041, Peoples R China
关键词
Formation enthalpy; Binary compounds; Miedema theory; Machine learning; TRANSITION-METALS; CHEMICAL-BOND; ELECTRONEGATIVITY; FRAMEWORK; ALLOYS;
D O I
10.1016/j.commatsci.2024.113277
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
As contemporary computational technologies and machine learning methodologies rapidly evolve, machine learning (ML) models for predicting formation enthalpies of materials exhibited convincible numerical precision and remarkable predictive efficiency, thus establishing a solid foundation for materials thermodynamic design. Despite achieving numerically high global probability accuracy, current ML models for formation enthalpy nonetheless exhibit suboptimal local accuracy within specific physical domain, which can be attributed to the misalignment between the physical constraints of chemical bonds and the critical descriptors capturing classspecific traits. Herein, we propose a novel approach to improve the local precision of the ML model for predicting formation enthalpy by utilizing Miedema theory-based classification, which segments data into distinct categories according to the electronegativity difference, electron density discontinuity and atomic size difference. Utilizing ML algorithms to build surrogate models guided by the classification strategy significantly improves the local predictive accuracy of formation enthalpy for specific binary compounds, significantly raising the R2 value from 0.4-0.9 to 0.8-0.9 compared to an unclassified method. Furthermore, feature importance analysis demonstrates that the pivotal factors for each category vary in some manner, highlighting the insufficiency of a sole ML model in classifying large-dimensional data, which can be addressed by adopting a physicsinformed classification strategy. Our results suggest that employing physical-informed classification scheme into ML equips the models with broad applicability and local accuracy, which also shed light for other material properties predication.
引用
收藏
页数:10
相关论文
共 68 条
[1]   Perspective: Materials informatics and big data: Realization of the "fourth paradigm" of science in materials science [J].
Agrawal, Ankit ;
Choudhary, Alok .
APL MATERIALS, 2016, 4 (05)
[2]   A SCALE OF ELECTRONEGATIVITY BASED ON ELECTROSTATIC FORCE [J].
ALLRED, AL ;
ROCHOW, EG .
JOURNAL OF INORGANIC & NUCLEAR CHEMISTRY, 1958, 5 (04) :264-268
[3]  
Bakker H., 1998, Trans. tech. publications
[4]   Exchange-correlation functionals for band gaps of solids: benchmark, reparametrization and machine learning [J].
Borlido, Pedro ;
Schmidt, Jonathan ;
Huran, Ahmad W. ;
Tran, Fabien ;
Marques, Miguel A. L. ;
Botti, Silvana .
NPJ COMPUTATIONAL MATERIALS, 2020, 6 (01)
[5]   Phase prediction and effect of intrinsic residual strain on phase stability in high-entropy alloys with machine learning [J].
Chang, Huinan ;
Tao, Yiwen ;
Liaw, Peter K. ;
Ren, Jingli .
JOURNAL OF ALLOYS AND COMPOUNDS, 2022, 921
[6]   Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals [J].
Chen, Chi ;
Ye, Weike ;
Zuo, Yunxing ;
Zheng, Chen ;
Ong, Shyue Ping .
CHEMISTRY OF MATERIALS, 2019, 31 (09) :3564-3572
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]   AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations [J].
Curtarolo, Stefano ;
Setyawan, Wahyu ;
Wang, Shidong ;
Xue, Junkai ;
Yang, Kesong ;
Taylor, Richard H. ;
Nelson, Lance J. ;
Hart, Gus L. W. ;
Sanvito, Stefano ;
Buongiorno-Nardelli, Marco ;
Mingo, Natalio ;
Levy, Ohad .
COMPUTATIONAL MATERIALS SCIENCE, 2012, 58 :227-235
[9]   A Statistical Learning Framework for Materials Science: Application to Elastic Moduli of k-nary Inorganic Polycrystalline Compounds [J].
de Jong, Maarten ;
Chen, Wei ;
Notestine, Randy ;
Persson, Kristin ;
Ceder, Gerbrand ;
Jain, Anubhav ;
Asta, Mark ;
Gamst, Anthony .
SCIENTIFIC REPORTS, 2016, 6
[10]   THEORY OF BONDING OF TRANSITION-METALS TO NON-TRANSITION METALS [J].
GELATT, CD ;
WILLIAMS, AR ;
MORUZZI, VL .
PHYSICAL REVIEW B, 1983, 27 (04) :2005-2013