Stabilized SQP revisited

被引:0
作者
A. F. Izmailov
M. V. Solodov
机构
[1] Moscow State University,Department of Operations Research, Faculty of Computational Mathematics and Cybernetics
[2] IMPA,undefined
[3] Instituto de Matemática Pura e Aplicada,undefined
来源
Mathematical Programming | 2012年 / 133卷
关键词
Constrained optimization; Degenerate constraints; Second-order sufficiency; Critical multipliers; Error bound; Stabilized SQP; Superlinear convergence; Local regularization; 90C30; 90C33; 90C55; 65K05;
D O I
暂无
中图分类号
学科分类号
摘要
The stabilized version of the sequential quadratic programming algorithm (sSQP) had been developed in order to achieve superlinear convergence in situations when the Lagrange multipliers associated to a solution are not unique. Within the framework of Fischer (Math Program 94:91–124, 2002), the key to local superlinear convergence of sSQP are the following two properties: upper Lipschitzian behavior of solutions of the Karush-Kuhn-Tucker (KKT) system under canonical perturbations and local solvability of sSQP subproblems with the associated primal-dual step being of the order of the distance from the current iterate to the solution set of the unperturbed KKT system. According to Fernández and Solodov (Math Program 125:47–73, 2010), both of these properties are ensured by the second-order sufficient optimality condition (SOSC) without any constraint qualification assumptions. In this paper, we state precise relationships between the upper Lipschitzian property of solutions of KKT systems, error bounds for KKT systems, the notion of critical Lagrange multipliers (a subclass of multipliers that violate SOSC in a very special way), the second-order necessary condition for optimality, and solvability of sSQP subproblems. Moreover, for the problem with equality constraints only, we prove superlinear convergence of sSQP under the assumption that the dual starting point is close to a noncritical multiplier. Since noncritical multipliers include all those satisfying SOSC but are not limited to them, we believe this gives the first superlinear convergence result for any Newtonian method for constrained optimization under assumptions that do not include any constraint qualifications and are weaker than SOSC. In the general case when inequality constraints are present, we show that such a relaxation of assumptions is not possible. We also consider applying sSQP to the problem where inequality constraints are reformulated into equalities using slack variables, and discuss the assumptions needed for convergence in this approach. We conclude with consequences for local regularization methods proposed in (Izmailov and Solodov SIAM J Optim 16:210–228, 2004; Wright SIAM J. Optim. 15:673–676, 2005). In particular, we show that these methods are still locally superlinearly convergent under the noncritical multiplier assumption, weaker than SOSC employed originally.
引用
收藏
页码:93 / 120
页数:27
相关论文
共 31 条
  • [1] Boggs B.T.(1996)Sequential quadratic programming Acta Numerica 4 1-51
  • [2] Tolle J.W.(1994)Local analysis of Newton-type methods for variational inequalities and nonlinear programming Appl. Math. Optim. 29 161-186
  • [3] Bonnans J.F.(2004)A class of active-set Newton methods for mixed complementarity problems SIAM J. Optim. 15 409-429
  • [4] Daryina A.N.(1952)Definite and semidefinite quadratic forms Econometrica 20 295-300
  • [5] Izmailov A.F.(1999)On the accurate identification of active constraints SIAM J. Optim. 9 14-32
  • [6] Solodov M.V.(2010)Stabilized sequential quadratic programming for optimization and a stabilized Newton-type method for variational problems Math. Program. 125 47-73
  • [7] Debreu G.(1937)Über das vorkommen definiter und semidefiniter formen und scharen quadratischer formen Commentarii Matematici Helvetica 94 188-192
  • [8] Facchinei F.(2002)Local behavior of an iterative framework for generalized equations with nonisolated solutions Math. Program. 94 91-124
  • [9] Fischer A.(1999)Stabilized sequential quadratic programming Comput. Optim. Appl. 12 253-273
  • [10] Kanzow C.(1999)Stability in the presence of degeneracy and error estimation Math. Program. 85 181-192