Evaluation of machine learning interpolation techniques for prediction of physical properties

被引:44
作者
Belisle, Eve [1 ]
Huang, Zi [1 ]
Le Digabel, Sebastien [3 ,4 ]
Gheribi, Aimen E. [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Ecole Polytech, Dept Chem Engn, CRCT Ctr Res Computat Thermochem, Montreal, PQ H3C 3A7, Canada
[3] Ecole Polytech, Gerad, Montreal, PQ H3C 3A7, Canada
[4] Ecole Polytech, Dept Math & Ind Engn, Montreal, PQ H3C 3A7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Superalloys; Database; Gaussian process; Neural network; Quadratic regression; Physical properties; Computational dependence; HETEROGENEOUS MARTENSITIC NUCLEATION; LINEAR INTERPOLATION; START TEMPERATURE; NEURAL-NETWORKS; REGRESSION; MODELS; SOFTWARE; KINETICS; DESIGN;
D O I
10.1016/j.commatsci.2014.10.032
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A knowledge of the physical properties of materials as a function of temperature, composition, applied external stresses, etc. is an important consideration in materials and process design. For new systems, such properties may be unknown and hard to measure or estimate from numerical simulations such as molecular dynamics. Engineers rely on machine learning to employ existing data in order to predict properties for new systems. Several techniques are currently used for such purposes. These include neural network, polynomial interpolation and Gaussian processes as well as the more recent dynamic trees and scalable Gaussian processes. In this paper we compare these approaches for three sets of materials sciences data: molar volume, electrical conductivity and Martensite start temperature. We make recommendations depending on the nature of the data. We demonstrate that a thorough knowledge of the problem beforehand is critical in selecting the most successful machine learning technique. Our findings show that the Gaussian process regression technique gives very good predictions for all three sets of tested data. Typically, Gaussian process is very slow with a computational complexity of typically n(3) where n is the number of data points. In this paper, we found that the scalable Gaussian process approach was able to maintain the high accuracy of the predictions while improving speed considerably, make on-line learning possible. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:170 / 177
页数:8
相关论文
共 49 条
[11]   Dependence of martensite start temperature on fine austenite grain size [J].
Garcia-Junceda, A. ;
Capdevila, C. ;
Caballero, F. G. ;
de Andres, C. Garcia .
SCRIPTA MATERIALIA, 2008, 58 (02) :134-137
[12]   Calculating optimal conditions for alloy and process design using thermodynamic and property databases, the Fact Sage software and the Mesh Adaptive Direct Search algorithm [J].
Gheribi, A. E. ;
Audet, C. ;
Le Digabel, S. ;
Belisle, E. ;
Bale, C. W. ;
Pelton, A. D. .
CALPHAD-COMPUTER COUPLING OF PHASE DIAGRAMS AND THERMOCHEMISTRY, 2012, 36 :135-143
[13]   Identifying optimal conditions for magnesium based alloy design using the Mesh Adaptive Direct Search algorithm [J].
Gheribi, Aimen E. ;
Le Digabel, Sebastien ;
Audet, Charles ;
Chartrand, Patrice .
THERMOCHIMICA ACTA, 2013, 559 :107-110
[14]   Calculating all local minima on liquidus surfaces using the FactSage software and databases and the Mesh Adaptive Direct Search algorithm [J].
Gheribi, Aimen E. ;
Robelin, Christian ;
Le Digabel, Sebastien ;
Audet, Charles ;
Pelton, Arthur D. .
JOURNAL OF CHEMICAL THERMODYNAMICS, 2011, 43 (09) :1323-1330
[15]   KINETICS OF FCC-]BCC HETEROGENEOUS MARTENSITIC NUCLEATION .1. THE CRITICAL DRIVING-FORCE FOR ATHERMAL NUCLEATION [J].
GHOSH, G ;
OLSON, GB .
ACTA METALLURGICA ET MATERIALIA, 1994, 42 (10) :3361-3370
[16]   KINETICS OF FCC-]BCC HETEROGENEOUS MARTENSITIC NUCLEATION .2. THERMAL-ACTIVATION [J].
GHOSH, G ;
OLSON, GB .
ACTA METALLURGICA ET MATERIALIA, 1994, 42 (10) :3371-3379
[17]   Accuracy of quadratic versus linear interpolation in noninvasive Electrocardiographic Imaging (ECGI) [J].
Ghosh, S ;
Rudy, Y .
ANNALS OF BIOMEDICAL ENGINEERING, 2005, 33 (09) :1187-1201
[18]  
GIBBS MN, STAT COMPUT UNPUB
[19]  
GRAMACY R, 2010, DYNATREE AN R PACKAG
[20]   Bayesian Treed Gaussian Process Models With an Application to Computer Modeling [J].
Gramacy, Robert B. ;
Lee, Herbert K. H. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (483) :1119-1130