Machine learning-based surrogate modeling for data-driven optimization: a comparison of subset selection for regression techniques

被引:0
作者
Sun Hye Kim
Fani Boukouvala
机构
[1] Georgia Institute of Technology,School of Chemical & Biomolecular Engineering
来源
Optimization Letters | 2020年 / 14卷
关键词
Machine Learning; Surrogate modeling; Black-box optimization; Data-driven optimization; Subset selection for regression;
D O I
暂无
中图分类号
学科分类号
摘要
Optimization of simulation-based or data-driven systems is a challenging task, which has attracted significant attention in the recent literature. A very efficient approach for optimizing systems without analytical expressions is through fitting surrogate models. Due to their increased flexibility, nonlinear interpolating functions, such as radial basis functions and Kriging, have been predominantly used as surrogates for data-driven optimization; however, these methods lead to complex nonconvex formulations. Alternatively, commonly used regression-based surrogates lead to simpler formulations, but they are less flexible and inaccurate if the form is not known a priori. In this work, we investigate the efficiency of subset selection regression techniques for developing surrogate functions that balance both accuracy and complexity. Subset selection creates sparse regression models by selecting only a subset of original features, which are linearly combined to generate a diverse set of surrogate models. Five different subset selection techniques are compared with commonly used nonlinear interpolating surrogate functions with respect to optimization solution accuracy, computation time, sampling requirements, and model sparsity. Our results indicate that subset selection-based regression functions exhibit promising performance when the dimensionality is low, while interpolation performs better for higher dimensional problems.
引用
收藏
页码:989 / 1010
页数:21
相关论文
共 91 条
[1]  
Boukouvala F(2017)ARGONAUT: AlgoRithms for Global Optimization of coNstrAined grey-box compUTational problems Optim. Lett. 11 895-913
[2]  
Floudas CA(2014)Learning surrogate models for simulation-based optimization AIChE J. 60 2211-2227
[3]  
Cozad A(2014)Simulation optimization: a review of algorithms and applications 4OR 12 301-333
[4]  
Sahinidis NV(2004)Simulation optimization: a comprehensive review on theory and applications IIE Trans. 36 1067-1081
[5]  
Miller DC(2018)Advances in surrogate based modeling, feasibility analysis, and optimization: a review Comput. Chem. Eng. 108 250-267
[6]  
Amaran S(2017)A trust region-based two phase algorithm for constrained black-box and grey-box optimization with infeasible initial point Comput. Chem. Eng. 116 306-321
[7]  
Tekin E(2009)Recent advances in surrogate-based optimization Prog. Aerosp. Sci. 45 50-79
[8]  
Sabuncuoglu I(2010)A method for simulation based optimization using radial basis functions Optim. Eng. 11 501-532
[9]  
Bhosekar A(2011)Dynamic data-driven modeling of pharmaceutical processes Ind. Eng. Chem. Res. 50 6743-6754
[10]  
Ierapetritou M(2018)Optimization of a small-scale LNG supply chain Energy 148 79-89