A machine-learning-based alloy design platform that enables both forward and inverse predictions for thermo-mechanically controlled processed (TMCP) steel alloys

被引:38
作者
Lee, Jin-Woong [1 ]
Park, Chaewon [1 ]
Do Lee, Byung [1 ]
Park, Joonseo [1 ]
Goo, Nam Hoon [2 ]
Sohn, Kee-Sun [1 ]
机构
[1] Sejong Univ, Nanotechnol & Adv Mat Engn, 209 Neungdong Ro, Seoul 143747, South Korea
[2] Hyundai Steel DangJin Works, Adv Res Team, Dangjin 31719, Chungnam, South Korea
基金
新加坡国家研究基金会;
关键词
HOLOGRAPHIC RESEARCH STRATEGY; NONDOMINATED SORTING APPROACH; ARTIFICIAL NEURAL-NETWORKS; HIGH ENTROPY ALLOYS; DISORDERED COMPOUND; VARIABLE SELECTION; GENETIC ALGORITHM; LOW-SYMMETRY; BAND-GAP; OPTIMIZATION;
D O I
10.1038/s41598-021-90237-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Predicting mechanical properties such as yield strength (YS) and ultimate tensile strength (UTS) is an intricate undertaking in practice, notwithstanding a plethora of well-established theoretical and empirical models. A data-driven approach should be a fundamental exercise when making YS/UTS predictions. For this study, we collected 16 descriptors (attributes) that implicate the compositional and processing information and the corresponding YS/UTS values for 5473 thermo-mechanically controlled processed (TMCP) steel alloys. We set up an integrated machine-learning (ML) platform consisting of 16 ML algorithms to predict the YS/UTS based on the descriptors. The integrated ML platform involved regularization-based linear regression algorithms, ensemble ML algorithms, and some non-linear ML algorithms. Despite the dirty nature of most real-world industry data, we obtained acceptable holdout dataset test results such as R-2>0.6 and MSE<0.01 for seven non-linear ML algorithms. The seven fully trained non-linear ML models were used for the ensuing 'inverse design (prediction)' based on an elitist-reinforced, non-dominated sorting genetic algorithm (NSGA-II). The NSGA-II enabled us to predict solutions that exhibit desirable YS/UTS values for each ML algorithm. In addition, the NSGA-II-driven solutions in the 16-dimensional input feature space were visualized using holographic research strategy (HRS) in order to systematically compare and analyze the inverse-predicted solutions for each ML algorithm.
引用
收藏
页数:18
相关论文
共 75 条
[1]   RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION [J].
ALLEN, DM .
TECHNOMETRICS, 1974, 16 (01) :125-127
[2]   AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION [J].
ALTMAN, NS .
AMERICAN STATISTICIAN, 1992, 46 (03) :175-185
[3]   THERMO-CALC & DICTRA, computational tools for materials science [J].
Andersson, JO ;
Helander, T ;
Höglund, LH ;
Shi, PF ;
Sundman, B .
CALPHAD-COMPUTER COUPLING OF PHASE DIAGRAMS AND THERMOCHEMISTRY, 2002, 26 (02) :273-312
[4]   Using genetic algorithm and artificial neural network analyses to design an Al-Si casting alloy of minimum porosity [J].
Anijdan, SHM ;
Bahrami, A ;
Hosseini, HRM ;
Shafyei, A .
MATERIALS & DESIGN, 2006, 27 (07) :605-609
[5]  
[Anonymous], 1992, BAYESIAN INFERENCE S, DOI [10.1002/9781118033197, DOI 10.1002/9781118033197.CH4]
[6]  
[Anonymous], 1990, SPLINE MODELS OBSERV, DOI [DOI 10.1137/1.9781611970128, 10.1137/1.9781611970128]
[7]  
[Anonymous], 2008, Advances in neural information processing systems
[8]  
Ballard D. H., 1987, Aaai, V647, P279
[9]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[10]  
Chen T, 2016, KDD16 P 22 ACM, DOI DOI 10.1145/2939672.2939785