Learning to steer nonlinear interior-point methods

被引：0

作者：

Kuhlmann, Renke ^{[1
]}

机构：

[1] Ctr Ind Math ZeTeM, Optimizat & Optimal Control, Bibliothekstr 5, D-28359 Bremen, Germany

来源：

EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION | 2019年 / 7卷 / 04期

关键词：

Nonlinear programming; Constrained optimization; Interior-point algorithm; Reinforcement learning; Deep Q-learning; PRIMAL-DUAL METHODS; LINE-SEARCH; ALGORITHM; IMPLEMENTATION; OPTIMIZATION; SOFTWARE;

D O I：

10.1007/s13675-019-00118-4

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

Interior-point or barrier methods handle nonlinear programs by sequentially solving barrier subprograms with a decreasing sequence of barrier parameters. The specific barrier update rule strongly influences the theoretical convergence properties as well as the practical efficiency. While many global and local convergence analyses consider a monotone update that decreases the barrier parameter for every approximately solved subprogram, computational studies show a superior performance of more adaptive strategies. In this paper we interpret the adaptive barrier update as a reinforcement learning task. A deep Q-learning agent is trained by both imitation and random action selection. Numerical results based on an implementation within the nonlinear programming solver WORHP show that the agent successfully learns to steer the barrier parameter and additionally improves WORHP's performance on the CUTEst test set.

引用

页码：381 / 419

页数：39

共 50 条

[31] Extending interior-point methods to nonlinear second-order cone programming: Application to finite-strain elastoplasticity
El Boustani, Chadi
Bleyer, Jeremy
Arquier, Mathieu
Sab, Karam
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2021, 122 (01) : 270 - 293
[32] Interior-point methods on manifolds: theory and applications
Hirai, Hiroshi
Nieuwboer, Harold
Walter, Michael
2023 IEEE 64TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, FOCS, 2023, : 2021 - 2030
[33] Sparsity preserving preconditioners for linear systems in interior-point methods
Drazic, Milan D.
Lazovic, Rade P.
Kovacevic-Vujcic, Vera V.
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2015, 61 (03) : 557 - 570
[34] Interior-point l2-penalty methods for nonlinear programming with strong global convergence properties
Chen, L.
Goldfarb, D.
MATHEMATICAL PROGRAMMING, 2006, 108 (01) : 1 - 36
[35] An arc-search interior-point algorithm for nonlinear constrained optimization
Yang, Yaguang
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2025, 90 (03) : 969 - 995
[36] Solving problems with semidefinite and related constraints using interior-point methods for nonlinear programming
Benson, HY
Vanderbei, RJ
MATHEMATICAL PROGRAMMING, 2003, 95 (02) : 279 - 302
[37] INEXACT INTERIOR-POINT METHOD FOR PDE-CONSTRAINED NONLINEAR OPTIMIZATION
Grote, Marcus J.
Huber, Johannes
Kourounis, Drosos
Schenk, Olaf
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (03) : A1251 - A1276
[38] An Interior-Point l1-Penalty Method for Nonlinear Optimization
Gould, Nick I. M.
Orban, Dominique
Toint, Philippe L.
NUMERICAL ANALYSIS AND OPTIMIZATION, NAO-III, 2015, 134 : 117 - 150
[39] Solving Problems with Semidefinite and Related Constraints Using Interior-Point Methods for Nonlinear Programming
Hande Y. Benson
Robert J. Vanderbei
Mathematical Programming, 2003, 95 : 279 - 302
[40] A filter interior-point algorithm with projected Hessian updating for nonlinear optimization
Gu C.
Zhu D.
Journal of Applied Mathematics and Computing, 2009, 29 (1-2) : 67 - 80

← 1 2 3 4 5 →