Evaluation of performance of machine learning methods in mining structure-property data of halide perovskite materials

被引:11
|
作者
Zhao, Ruoting [1 ,2 ]
Xing, Bangyu [1 ,2 ]
Mu, Huimin [3 ]
Fu, Yuhao [3 ,4 ]
Zhang, Lijun [1 ,2 ,4 ]
机构
[1] Jilin Univ, State Key Lab Integrated Optoelect, Key Lab Automobile Mat MOE,Electron Microscopy Ct, Jilin Prov Int Cooperat Key Lab High Efficiency C, Changchun 130012, Peoples R China
[2] Jilin Univ, Sch Mat Sci & Engn, Changchun 130012, Peoples R China
[3] Jilin Univ, Coll Phys, State Key Lab Superhard Mat, Changchun 130012, Peoples R China
[4] Jilin Univ, Int Ctr Computat Method & Software, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
machine learning; material informatics; first-principles calculations; halide perovskites; ORGANIC-INORGANIC PEROVSKITES; DESIGN; APPROXIMATION; SINGLE;
D O I
10.1088/1674-1056/ac5d2d
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
With the rapid development of artificial intelligence and machine learning (ML) methods, materials science is rapidly entering the era of data-driven materials informatics. ML models serve as the most crucial component, closely bridging material structure and material properties. There is a considerable difference in the prediction performance of different ML methods for material systems. Herein, we evaluated three categories (linear, kernel, and nonlinear methods) of models, with twelve ML algorithms commonly used in the materials field. In addition, halide perovskite was chosen as an example to evaluate the fitting performance of different models. We constructed a total dataset of 540 halide perovskites and 72 features, with formation energy and bandgap as target properties. We found that different categories of ML models show similar trends for different target properties. Among them, the difference between the models is enormous for the formation energy, with the coefficient of determination (R-2) range 0.69-0.953. The fitting performance between the models is closer for bandgap, with the R-2 range 0.941-0.997. The nonlinear-ensemble model shows the best fitting performance for both the formation energy and the bandgap. It shows that the nonlinear-ensemble model, constructed by combining multiple weak learners, effectively describes the nonlinear relationship between material features and target property. In addition, the extreme gradient boosting decision tree model shows the most superior results among all the models and searches for two new descriptors that are crucial for formation energy and bandgap. Our work provides useful guidance for the selection of effective machine learning methods in the data-mining studies of specific material systems. The dataset that supported the findings of this study is available in Science Data Bank, with the link .
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Machine learning for halide perovskite materials
    Zhang, Lei
    He, Mu
    Shao, Shaofeng
    NANO ENERGY, 2020, 78
  • [2] Method construction of structure-property relationships from data by machine learning assisted mining for materials design applications
    Dai, Dongbo
    Liu, Qing
    Hu, Rui
    Wei, Xiao
    Ding, Guangtai
    Xu, Baoyu
    Xu, Tao
    Zhang, Jincang
    Xu, Yan
    Zhang, Huiran
    MATERIALS & DESIGN, 2020, 196
  • [3] PERFORMANCE PREDICTION OF HALIDE DOUBLE PEROVSKITE MATERIALS BASED ON MACHINE LEARNING
    Zhang Q.
    Xu Z.
    Feng P.
    Tu J.
    Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2024, 45 (04): : 107 - 115
  • [4] A novel structure-property relationship model based on machine learning
    Zhang, Huiran
    Guo, Zhiting
    Hu, Hongqing
    Zhou, Gaofeng
    Liu, Qing
    Xu, Yan
    Qian, Quan
    Dai, Dongbo
    MODELLING AND SIMULATION IN MATERIALS SCIENCE AND ENGINEERING, 2020, 28 (03)
  • [5] Machine learning for structure determination and investigating the structure-property relationships of interfaces
    Oda, Hiromi
    Kiyohara, Shin
    Mizoguchi, Teruyasu
    JOURNAL OF PHYSICS-MATERIALS, 2019, 2 (03):
  • [6] The structure-property relationship of granular materials with different friction coefficients: Insight from machine learning
    Zhang, Yibo
    Zhou, Wei
    Ma, Gang
    Cheng, Ruilin
    Chang, Xiaolin
    EXTREME MECHANICS LETTERS, 2022, 54
  • [7] An efficient machine learning approach to establish structure-property linkages
    Jung, Jaimyun
    Yoon, Jae Ik
    Park, Hyung Keun
    Kim, Jin You
    Kim, Hyoung Seop
    COMPUTATIONAL MATERIALS SCIENCE, 2019, 156 : 17 - 25
  • [8] Developing Catalysts via Structure-Property Relations Discovered by Machine Learning: An Industrial Perspective
    Joshi, Hrishikesh
    Wilde, Nicole
    Asche, Thomas S.
    Wolf, Dorit
    CHEMIE INGENIEUR TECHNIK, 2022, 94 (11) : 1645 - 1654
  • [9] Rationalizing Perovskite Data for Machine Learning and Materials Design
    Xu, Qichen
    Li, Zhenzhu
    Liu, Miao
    Yin, Wan-Jian
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2018, 9 (24): : 6948 - 6954
  • [10] Editorial: Machine Learning and Data Mining in Materials Science
    Huber, Norbert
    Kalidindi, Surya R.
    Klusemann, Benjamin
    Cyron, Christian J.
    FRONTIERS IN MATERIALS, 2020, 7