Learning to steer nonlinear interior-point methods

被引:0
|
作者
Kuhlmann, Renke [1 ]
机构
[1] Ctr Ind Math ZeTeM, Optimizat & Optimal Control, Bibliothekstr 5, D-28359 Bremen, Germany
关键词
Nonlinear programming; Constrained optimization; Interior-point algorithm; Reinforcement learning; Deep Q-learning; PRIMAL-DUAL METHODS; LINE-SEARCH; ALGORITHM; IMPLEMENTATION; OPTIMIZATION; SOFTWARE;
D O I
10.1007/s13675-019-00118-4
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Interior-point or barrier methods handle nonlinear programs by sequentially solving barrier subprograms with a decreasing sequence of barrier parameters. The specific barrier update rule strongly influences the theoretical convergence properties as well as the practical efficiency. While many global and local convergence analyses consider a monotone update that decreases the barrier parameter for every approximately solved subprogram, computational studies show a superior performance of more adaptive strategies. In this paper we interpret the adaptive barrier update as a reinforcement learning task. A deep Q-learning agent is trained by both imitation and random action selection. Numerical results based on an implementation within the nonlinear programming solver WORHP show that the agent successfully learns to steer the barrier parameter and additionally improves WORHP's performance on the CUTEst test set.
引用
收藏
页码:381 / 419
页数:39
相关论文
共 50 条
  • [31] An easy way to teach interior-point methods
    Terlaky, T
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 130 (01) : 1 - 19
  • [32] Interior-point methods on manifolds: theory and applications
    Hirai, Hiroshi
    Nieuwboer, Harold
    Walter, Michael
    2023 IEEE 64TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, FOCS, 2023, : 2021 - 2030
  • [33] Stabilization of Interior-Point Methods for Linear Programming
    Vera V. Kovacevic-Vujcic
    Miroslav D. Asic
    Computational Optimization and Applications, 1999, 14 : 331 - 346
  • [34] Solving linear systems in interior-point methods
    Seol, T
    Park, S
    COMPUTERS & OPERATIONS RESEARCH, 2002, 29 (04) : 317 - 326
  • [35] Optimal truss design by interior-point methods
    Jarre, F
    Kocvara, M
    Zowe, J
    SIAM JOURNAL ON OPTIMIZATION, 1998, 8 (04) : 1084 - 1107
  • [36] Stabilization of interior-point methods for linear programming
    Kovacevic-Vujcic, VV
    Asic, MD
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1999, 14 (03) : 331 - 346
  • [37] FAST INTERIOR-POINT METHODS FOR BIPARTITE MATCHING
    GROVER, LK
    SIAM JOURNAL ON OPTIMIZATION, 1995, 5 (04) : 740 - 769
  • [38] Interior-point methods for magnitude filter design
    Alkire, B
    Vandenberghe, L
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 3821 - 3824
  • [39] Proximal Methods in View of Interior-Point Strategies
    A. Kaplan
    R. Tichatschke
    Journal of Optimization Theory and Applications, 1998, 98 : 399 - 429
  • [40] INTERIOR-POINT METHODS FOR CONVEX-PROGRAMMING
    JARRE, F
    APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (03): : 287 - 311