Explicit Convergence Rates of Greedy and Random Quasi-Newton Methods

被引:0
作者
Lin, Dachao [1 ]
Ye, Haishan [2 ]
Zhang, Zhihua [3 ]
机构
[1] Peking Univ, Acad Adv Interdisciplinary Studies, Beijing, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Management, Xian, Peoples R China
[3] Peking Univ, Sch Math Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
quasi-Newton methods; superlinear convergence; local convergence; rate of convergence; Broyden family; SR1; BFGS; DFP; SUPERLINEAR CONVERGENCE; GLOBAL CONVERGENCE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Optimization is important in machine learning problems, and quasi-Newton methods have a reputation as the most efficient numerical methods for smooth unconstrained optimization. In this paper, we study the explicit superlinear convergence rates of quasi-Newton methods and address two open problems mentioned by Rodomanov and Nesterov (2021b). First, we extend Rodomanov and Nesterov (2021b)'s results to random quasi-Newton methods, which include common DFP, BFGS, SR1 methods. Such random methods employ a random direction for updating the approximate Hessian matrix in each iteration. Second, we focus on the specific quasi-Newton methods: SR1 and BFGS methods. We provide improved versions of greedy and random methods with provable better explicit (local) superlinear convergence rates. Our analysis is closely related to the approximation of a given Hessian matrix, unconstrained quadratic objective, as well as the general strongly convex, smooth, and strongly self-concordant functions.
引用
收藏
页数:40
相关论文
共 34 条
[1]  
[Anonymous], 2008, P 25 INT C MACH LEAR
[2]  
Bordes A, 2009, J MACH LEARN RES, V10, P1737
[3]   On-line learning for very large data sets [J].
Bottou, U ;
Le Cun, Y .
APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2005, 21 (02) :137-151
[4]  
Broyden C. G., 1970, Journal of the Institute of Mathematics and Its Applications, V6, P222
[5]  
Broyden C. G., 1973, Journal of the Institute of Mathematics and Its Applications, V12, P223
[6]   QUASI-NEWTON METHODS AND THEIR APPLICATION TO FUNCTION MINIMISATION [J].
BROYDEN, CG .
MATHEMATICS OF COMPUTATION, 1967, 21 (99) :368-&
[7]   GLOBAL CONVERGENCE OF A CLASS OF QUASI-NEWTON METHODS ON CONVEX PROBLEMS [J].
BYRD, RH ;
NOCEDAL, J ;
YUAN, YX .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 1987, 24 (05) :1171-1190
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]   VARIABLE METRIC METHOD FOR MINIMIZATION [J].
Davidon, William C. .
SIAM JOURNAL ON OPTIMIZATION, 1991, 1 (01) :1-17
[10]  
DENNIS JE, 1974, MATH COMPUT, V28, P549, DOI 10.1090/S0025-5718-1974-0343581-1