A J-symmetric quasi-newton method for minimax problems

被引:1
作者
Asl, Azam [1 ]
Lu, Haihao [1 ]
Yang, Jinwen [2 ]
机构
[1] Univ Chicago Booth Sch Business, Chicago, IL 60611 USA
[2] Univ Chicago, Dept Stat, Chicago, IL USA
关键词
Minimax optimization; Quasi-Newton method; J-symmetric; Superlinear convergence; Trust region; PROXIMAL POINT ALGORITHM; SUPERLINEAR CONVERGENCE; GLOBAL CONVERGENCE; MONOTONE-OPERATORS;
D O I
10.1007/s10107-023-01957-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Minimax problems have gained tremendous attentions across the optimization and machine learning community recently. In this paper, we introduce a new quasi-Newton method for the minimax problems, which we call J-symmetric quasi-Newton method. The method is obtained by exploiting the J-symmetric structure of the second-order derivative of the objective function in minimax problem. We show that the Hessian estimation (as well as its inverse) can be updated by a rank-2 operation, and it turns out that the update rule is a natural generalization of the classic Powell symmetric Broyden method from minimization problems to minimax problems. In theory, we show that our proposed quasi-Newton algorithm enjoys local Q-superlinear convergence to a desirable solution under standard regularity conditions. Furthermore, we introduce a trust-region variant of the algorithm that enjoys global R-superlinear convergence. Finally, we present numerical experiments that verify our theory and show the effectiveness of our proposed algorithms compared to Broyden's method and the extragradient method on three classes of minimax problems.
引用
收藏
页码:207 / 254
页数:48
相关论文
共 64 条
[51]  
Ortega J.M., 1970, ITERATIVE SOLUTION N
[52]  
Osborne M. J., 1994, A course in game theory
[53]   FAST EXACT MULTIPLICATION BY THE HESSIAN [J].
PEARLMUTTER, BA .
NEURAL COMPUTATION, 1994, 6 (01) :147-160
[54]  
Powell M. J., 1970, NONLINEAR PROGRAMMIN, P31, DOI DOI 10.1016/B978-0-12-597050-1.50006-3
[55]  
Powell M.J., 1975, SIAM, P53
[56]  
Powell MJD, 1970, Numerical Methods for Nonlinear Algebraic Equations, P87
[57]   MONOTONE OPERATORS AND PROXIMAL POINT ALGORITHM [J].
ROCKAFELLAR, RT .
SIAM JOURNAL ON CONTROL, 1976, 14 (05) :877-898
[58]   Fast curvature matrix-vector products for second-order gradient descent [J].
Schraudolph, NN .
NEURAL COMPUTATION, 2002, 14 (07) :1723-1738
[59]   CONDITIONING OF QUASI-NEWTON METHODS FOR FUNCTION MINIMIZATION [J].
SHANNO, DF .
MATHEMATICS OF COMPUTATION, 1970, 24 (111) :647-&
[60]  
Sidi A., 2003, J COMPUT APPL MATH, V2