Detecting negative eigenvalues of exact and approximate Hessian matrices in optimization

被引:1
作者
Hare, Warren [1 ]
Royer, Clement W. [2 ]
机构
[1] Univ British Columbia, Dept Math, Okanagan Campus, Kelowna, BC V1V1V7, Canada
[2] Univ Paris Dauphine PSL, LAMSADE, CNRS, Pl Marechal de Lattre de Tassigny, F-75016 Paris, France
基金
加拿大自然科学与工程研究理事会;
关键词
Negative curvature; Blackbox optimization; Nonconvex optimization; Hessian matrices; ALGORITHMS; DIRECTIONS;
D O I
10.1007/s11590-023-02033-5
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Nonconvex minimization algorithms often benefit from the use of second-order information as represented by the Hessian matrix. When the Hessian at a critical point possesses negative eigenvalues, the corresponding eigenvectors can be used to search for further improvement in the objective function value. Computing such eigenpairs can be computationally challenging, particularly if the Hessian matrix itself cannot be built directly but must rather be sampled or approximated. In blackbox optimization, such derivative approximations are built at a significant cost in terms of function values. In this paper, we investigate practical approaches to detect negative eigenvalues in Hessian matrices without accessing the full matrix. We propose a general framework that begins with the diagonal and gradually builds submatrices to detect negative curvature. Crucially, our approach works both when exact Hessian coordinate values are available and when Hessian coordinate values are approximated. We compare several instances of our framework on a test set of Hessian matrices from a popular optimization library, and finite-differences approximations thereof. Our experiments highlight the importance of the variable order in the problem description, and show that forming submatrices is often an efficient approach to detect negative curvature.
引用
收藏
页码:1739 / 1756
页数:18
相关论文
共 21 条
[1]   A subclass of generating set search with convergence to second-order stationary points [J].
Abramson, Mark A. ;
Frimannslund, Lennart ;
Steihaug, Trond .
OPTIMIZATION METHODS & SOFTWARE, 2014, 29 (05) :900-918
[2]  
Audet C., 2017, Springer Series in Operations Research and Financial Engineering
[3]   FINDING ALL CLIQUES OF AN UNDIRECTED GRAPH [H] [J].
BRON, C ;
KERBOSCH, J .
COMMUNICATIONS OF THE ACM, 1973, 16 (09) :575-577
[4]   Complexity bounds for second-order optimality in unconstrained optimization [J].
Cartis, C. ;
Gould, N. I. M. ;
Toint, Ph. L. .
JOURNAL OF COMPLEXITY, 2012, 28 (01) :93-108
[5]  
Conn A., 2000, MPS-SIAM Series on Optimization, V1, DOI DOI 10.1137/1.9780898719857
[6]  
CONN A. R., 2009, Introduction to Derivative-Free Optimization, DOI DOI 10.1137/1.9780898718768
[7]   GLOBAL CONVERGENCE OF GENERAL DERIVATIVE-FREE TRUST-REGION ALGORITHMS TO FIRST- AND SECOND-ORDER CRITICAL POINTS [J].
Conn, Andrew R. ;
Scheinberg, Katya ;
Vicente, Luis N. .
SIAM JOURNAL ON OPTIMIZATION, 2009, 20 (01) :387-415
[8]   Concise complexity analyses for trust region methods [J].
Curtis, Frank E. ;
Lubberts, Zachary ;
Robinson, Daniel P. .
OPTIMIZATION LETTERS, 2018, 12 (08) :1713-1724
[9]  
Dennis J., 1996, Numerical Methods for Unconstrained Optimization and Nonlinear Equations, P155, DOI [10.1137/1.9781611971200, DOI 10.1137/1.9781611971200]
[10]   A nonmonotone truncated Newton-Krylov method exploiting negative curvature directions, for large scale unconstrained optimization [J].
Fasano, Giovanni ;
Lucidi, Stefano .
OPTIMIZATION LETTERS, 2009, 3 (04) :521-535