Modeling design and control problems involving neural network surrogates

被引：11

作者：

Yang, Dominic ^{[1
]}

Balaprakash, Prasanna ^{[2
]}

Leyffer, Sven ^{[2
]}

机构：

[1] Univ Calif Los Angeles, Los Angeles, CA 90095 USA

[2] Argonne Natl Lab, Lemont, IL USA

来源：

COMPUTATIONAL OPTIMIZATION AND APPLICATIONS | 2022年 / 83卷 / 03期

关键词：

Mixed-integer programming; Nonlinear programming; Complementarity constraints; Machine learning; Neural networks; MATHEMATICAL PROGRAMS; COMPLEMENTARITY CONSTRAINTS; ALGORITHM;

D O I：

10.1007/s10589-022-00404-9

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

We consider nonlinear optimization problems that involve surrogate models represented by neural networks. We demonstrate first how to directly embed neural network evaluation into optimization models, highlight a difficulty with this approach that can prevent convergence, and then characterize stationarity of such models. We then present two alternative formulations of these problems in the specific case of feedforward neural networks with ReLU activation: as a mixed-integer optimization problem and as a mathematical program with complementarity constraints. For the latter formulation we prove that stationarity at a point for this problem corresponds to stationarity of the embedded formulation. Each of these formulations may be solved with state-of-the-art optimization methods, and we show how to obtain good initial feasible solutions for these methods. We compare our formulations on three practical applications arising in the design and control of combustion engines, in the generation of adversarial attacks on classifier networks, and in the determination of optimal flows in an oil well network.

引用

页码：759 / 800

页数：42

共 62 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2] MaLTESE: Large-Scale Simulation-Driven Machine Learning for Transient Driving Cycles [J].

Aithal, Shashi M. ;

Balaprakash, Prasanna .

HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2019, 2019, 11501 :186-205

[3] Strong Mixed-Integer Programming Formulations for Trained Neural Networks [J].

Anderson, Ross ;

Huchette, Joey ;

Tjandraatmadja, Christian ;

Vielma, Juan Pablo .

INTEGER PROGRAMMING AND COMBINATORIAL OPTIMIZATION, IPCO 2019, 2019, 11480 :27-42

[4]

[Anonymous], 2012, Gurobi optimizer reference manual

[5] MPEC problem formulations and solution strategies with chemical engineering applications [J].

Baumrucker, B. T. ;

Renfro, J. G. ;

Biegler, L. T. .

COMPUTERS & CHEMICAL ENGINEERING, 2008, 32 (12) :2903-2913

[6]

Belotti, 2020, TECHNICAL REPORT FIC

[7] Network Models for Multiobjective Discrete Optimization [J].

Bergman, David ;

Bodur, Merve ;

Cardonha, Carlos ;

Cire, Andre A. .

INFORMS JOURNAL ON COMPUTING, 2022, 34 (02) :990-1005

[8] Conservative set valued fields, automatic differentiation, stochastic gradient methods and deep learning [J].

Bolte, Jerome ;

Pauwels, Edouard .

MATHEMATICAL PROGRAMMING, 2021, 188 (01) :19-51

[9]

Bonami Pierre., 2007, NUMER MATH, V4, P1

[10]

Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)

← 1 2 3 4 5 6 7 →