Improving prediction accuracy of high-performance materials via modified machine learning strategy

被引:19
作者
Yong, Wei [1 ]
Zhang, Hongtao [1 ,2 ]
Fu, Huadong [1 ,2 ]
Zhu, Yaliang [1 ]
He, Jie [3 ]
Xie, Jianxin [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Beijing Adv Innovat Ctr Mat Genome Engn, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Key Lab Adv Mat Proc MOE, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine learning; Extrapolation strategy; Cross validation; Evaluation Strategy; BULK METALLIC GLASSES; HIGH-ENTROPY ALLOYS; STRENGTH; DESIGN; CONDUCTIVITY; DISCOVERY; SELECTION; TERNARY;
D O I
10.1016/j.commatsci.2021.111181
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In general, machine learning models trained by traditional cross validation evaluation strategies always have excellent interpolation ability in the known data space, but weak extrapolation ability in the unknown data space. However, in the field of material research, discovering new materials with higher performance always needs high prediction accuracy on the unknown data space outside the existing data range. In order to solve this challenge, we propose a modified evaluation strategy of machine learning modeling to promote the extrapolation ability by sorting the existing data according to the property value before the training/testing data set partition. It is demonstrated that the extrapolation and prediction abilities of machine learning model can be significantly improved by using this strategy. Machine learning models with lower prediction error and higher efficiency of discovering new materials are obtained on three materials data sets (bulk metallic glasses dataset, high-entropy alloys dataset, and piezoelectric materials dataset). The model errors are reduced from 23.84%, 9.79% and 14.47% of the traditional strategy to 2.44%, 7.67% and 8.07% of the modified strategy, respectively. It is indicated that this strategy can provide a solution for machine learning modeling in material field with requirements of discovering new materials with higher performance.
引用
收藏
页数:8
相关论文
共 44 条
  • [1] Fast cross-validation algorithms for least squares support vector machine and kernel ridge regression
    An, Senjian
    Liu, Wanquan
    Venkatesh, Svetha
    [J]. PATTERN RECOGNITION, 2007, 40 (08) : 2154 - 2162
  • [2] Adaptive machine learning for efficient materials design
    Balachandran, Prasanna V.
    [J]. MRS BULLETIN, 2020, 45 (07) : 579 - 586
  • [3] Machine learning for molecular and materials science
    Butler, Keith T.
    Davies, Daniel W.
    Cartwright, Hugh
    Isayev, Olexandr
    Walsh, Aron
    [J]. NATURE, 2018, 559 (7715) : 547 - 555
  • [4] Bulk metallic glasses in the Zr-Al-Ni-Cu system
    Chen, W
    Wang, Y
    Qiang, J
    Dong, C
    [J]. ACTA MATERIALIA, 2003, 51 (07) : 1899 - 1907
  • [5] Ferroelectric, dielectric and piezoelectric properties of ferroelectric thin films and ceramics
    Damjanovic, D
    [J]. REPORTS ON PROGRESS IN PHYSICS, 1998, 61 (09) : 1267 - 1324
  • [6] Designing High Dielectric Permittivity Material in Barium Titanate
    Gao, Jinghui
    Liu, Yongbin
    Wang, Yan
    Hu, Xinghao
    Yan, Wenbo
    Ke, Xiaoqin
    Zhong, Lisheng
    He, Yuting
    Ren, Xiaobing
    [J]. JOURNAL OF PHYSICAL CHEMISTRY C, 2017, 121 (24) : 13106 - 13113
  • [7] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [8] Ferroelectric ceramics: History and technology
    Haertling, GH
    [J]. JOURNAL OF THE AMERICAN CERAMIC SOCIETY, 1999, 82 (04) : 797 - 818
  • [9] Data-Driven Materials Science: Status, Challenges, and Perspectives
    Himanen, Lauri
    Geurts, Amber
    Foster, Adam Stuart
    Rinke, Patrick
    [J]. ADVANCED SCIENCE, 2019, 6 (21)
  • [10] Machine-learning phase prediction of high-entropy alloys
    Huang, Wenjiang
    Martin, Pedro
    Zhuang, Houlong L.
    [J]. ACTA MATERIALIA, 2019, 169 : 225 - 236