Learning to steer nonlinear interior-point methods

被引：0

作者：

Kuhlmann, Renke ^{[1
]}

机构：

[1] Ctr Ind Math ZeTeM, Optimizat & Optimal Control, Bibliothekstr 5, D-28359 Bremen, Germany

来源：

EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION | 2019年 / 7卷 / 04期

关键词：

Nonlinear programming; Constrained optimization; Interior-point algorithm; Reinforcement learning; Deep Q-learning; PRIMAL-DUAL METHODS; LINE-SEARCH; ALGORITHM; IMPLEMENTATION; OPTIMIZATION; SOFTWARE;

D O I：

10.1007/s13675-019-00118-4

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

Interior-point or barrier methods handle nonlinear programs by sequentially solving barrier subprograms with a decreasing sequence of barrier parameters. The specific barrier update rule strongly influences the theoretical convergence properties as well as the practical efficiency. While many global and local convergence analyses consider a monotone update that decreases the barrier parameter for every approximately solved subprogram, computational studies show a superior performance of more adaptive strategies. In this paper we interpret the adaptive barrier update as a reinforcement learning task. A deep Q-learning agent is trained by both imitation and random action selection. Numerical results based on an implementation within the nonlinear programming solver WORHP show that the agent successfully learns to steer the barrier parameter and additionally improves WORHP's performance on the CUTEst test set.

引用

页码：381 / 419

页数：39

共 50 条

[1] Interior-point methods for nonconvex nonlinear programming: cubic regularization
Benson, Hande Y.
Shanno, David F.
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2014, 58 (02) : 323 - 346
[2] An interior-point piecewise linear penalty method for nonlinear programming
Chen, Lifeng
Goldfarb, Donald
MATHEMATICAL PROGRAMMING, 2011, 128 (1-2) : 73 - 122
[3] Mixed integer nonlinear programming using interior-point methods
Benson, Hande Y.
OPTIMIZATION METHODS & SOFTWARE, 2011, 26 (06) : 911 - 931
[4] ADAPTIVE BARRIER UPDATE STRATEGIES FOR NONLINEAR INTERIOR METHODS
Nocedal, Jorge
Wachter, Andreas
Waltz, Richard A.
SIAM JOURNAL ON OPTIMIZATION, 2009, 19 (04) : 1674 - 1693
[5] NEW INTERIOR-POINT METHODS FOR P*(κ)-NONLINEAR COMPLEMENTARITY PROBLEMS
Cho, You-Young
Cho, Gyeong-Mi
JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2021, 22 (05) : 901 - 917
[6] Interior-point methods for nonconvex nonlinear programming: cubic regularization
Hande Y. Benson
David F. Shanno
Computational Optimization and Applications, 2014, 58 : 323 - 346
[7] Interior-point methods for nonconvex nonlinear programming: regularization and warmstarts
Hande Y. Benson
David F. Shanno
Computational Optimization and Applications, 2008, 40 : 143 - 189
[8] Interior-point methods for nonconvex nonlinear programming: regularization and warmstarts
Benson, Hande Y.
Shanno, David F.
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2008, 40 (02) : 143 - 189
[9] Accelerating Condensed Interior-Point Methods on SIMD/GPU Architectures
Pacaud, Francois
Shin, Sungho
Schanen, Michel
Maldonado, Daniel Adrian
Anitescu, Mihai
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2024, 202 (01) : 184 - 203
[10] BOUNDS ON EIGENVALUES OF MATRICES ARISING FROM INTERIOR-POINT METHODS
Greif, Chen
Moulding, Erin
Orban, Dominique
SIAM JOURNAL ON OPTIMIZATION, 2014, 24 (01) : 49 - 83

← 1 2 3 4 5 →