A Globalization of L-BFGS and the Barzilai-Borwein Method for Nonconvex Unconstrained Optimization

被引:0
|
作者
Mannel, Florian [1 ]
机构
[1] Univ Lubeck, Inst Math & Image Comp, Maria Goeppert Str 3, D-23562 Lubeck, Germany
关键词
L-BFGS; Barzilai-Borwein methods; Cautious updates; Nonconvex optimization; Global convergence; Linear convergence; Hilbert space; QUASI-NEWTON METHODS; LIMITED-MEMORY; GLOBAL CONVERGENCE; LINEAR CONVERGENCE; ALGORITHMS; MINIMIZATION; CONSTRAINTS; MATRICES; FAMILY;
D O I
10.1007/s10957-024-02565-5
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We present a modified limited memory BFGS (L-BFGS) method that converges globally and linearly for nonconvex objective functions. Its distinguishing feature is that it turns into L-BFGS if the iterates cluster at a point near which the objective is strongly convex with Lipschitz gradients, thereby inheriting the outstanding effectiveness of the classical method. These strong convergence guarantees are enabled by a novel form of cautious updating, where, among others, it is decided anew in each iteration which of the stored pairs are used for updating and which ones are skipped. In particular, this yields the first modification of cautious updating for which all cluster points are stationary while the spectrum of the L-BFGS operator is not permanently restricted, and this holds without Lipschitz continuity of the gradient. In fact, for Wolfe-Powell line searches we show that continuity of the gradient is sufficient for global convergence, which extends to other descent methods. Since we allow the memory size to be zero in the globalized L-BFGS method, we also obtain a new globalization of the Barzilai-Borwein spectral gradient (BB) method. The convergence analysis is developed in Hilbert space under comparably weak assumptions and covers Armijo and Wolfe-Powell line searches. We illustrate the theoretical findings with numerical experiments. The experiments indicate that if one of the parameters of the cautious updating is chosen sufficiently small, then the modified method agrees entirely with L-BFGS/BB. We also discuss this in the theoretical part. An implementation of the new method is available on arXiv.
引用
收藏
页数:34
相关论文
共 50 条
  • [21] A new adaptive Barzilai and Borwein method for unconstrained optimization
    Hongwei Liu
    Zexian Liu
    Xiaoliang Dong
    Optimization Letters, 2018, 12 : 845 - 873
  • [22] A modified conjugate gradient algorithm with cyclic Barzilai-Borwein steplength for unconstrained optimization
    Xiao, Yunhai
    Song, Huina
    Wang, Zhiguo
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2012, 236 (13) : 3101 - 3110
  • [23] A new adaptive Barzilai and Borwein method for unconstrained optimization
    Liu, Hongwei
    Liu, Zexian
    Dong, Xiaoliang
    OPTIMIZATION LETTERS, 2018, 12 (04) : 845 - 873
  • [24] A Barzilai-Borwein conjugate gradient method
    YuHong Dai
    CaiXia Kou
    Science China Mathematics, 2016, 59 : 1511 - 1524
  • [25] A Barzilai-Borwein conjugate gradient method
    DAI YuHong
    KOU CaiXia
    ScienceChina(Mathematics), 2016, 59 (08) : 1511 - 1524
  • [26] A Barzilai-Borwein conjugate gradient method
    Dai YuHong
    Kou CaiXia
    SCIENCE CHINA-MATHEMATICS, 2016, 59 (08) : 1511 - 1524
  • [27] An Efficient Barzilai–Borwein Conjugate Gradient Method for Unconstrained Optimization
    Hongwei Liu
    Zexian Liu
    Journal of Optimization Theory and Applications, 2019, 180 : 879 - 906
  • [28] The Barzilai-Borwein Method for distributed optimization over unbalanced directed networks
    Hu, Jinhui
    Chen, Xin
    Zheng, Lifeng
    Zhang, Ling
    Li, Huaqing
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 99
  • [29] A New Analysis on the Barzilai-Borwein Gradient Method
    Dai Y.-H.
    Dai, Y.-H. (dyh@lsec.cc.ac.cn), 1600, Springer Science and Business Media Deutschland GmbH (01): : 187 - 198
  • [30] A Method for Stochastic L-BFGS Optimization
    Qi, Peng
    Zhou, Wei
    Han, Jizhong
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 156 - 160