Multi-block Nonconvex Nonsmooth Proximal ADMM: Convergence and Rates Under Kurdyka-Lojasiewicz Property

被引:14
作者
Yashtini, Maryam [1 ]
机构
[1] Georgetown Univ, Dept Math & Stat, 327A St Marys Hall 37th & O St NW, Washington, DC 20057 USA
关键词
Nonconvex nonsmooth optimization; Proximal ADMM; Kurdyka-Lojasiewicz property; Convergence; Convergence rates; ALTERNATING DIRECTION METHOD; VARIABLE SELECTION; MINIMIZATION; OPTIMIZATION; ALGORITHMS; REGULARIZATION; DECOMPOSITION; MULTIPLIERS; STEPSIZE;
D O I
10.1007/s10957-021-01919-7
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We study the convergence and convergence rates of a multi-block proximal alternating direction method of multipliers (PADMM) for solving linearly constrained separable nonconvex nonsmooth optimization problems. This algorithm is an important variant of the alternating direction method of multipliers (ADMM) which includes a proximal term in each subproblem, to cancel out complicated terms in applications where subproblems are not easy to solve or do not admit a simple closed form solution. We consider an over-relaxation step size in the dual update and provide a detailed proof of the convergence for any step size beta is an element of(0,2). We prove the convergence of the sequence generated by the PADMM by showing that it has a finite length and it is Cauchy. Under the powerful Kurdyka-Lojasiewicz (KL) property, we establish the convergence rates for the values and the iterates, and we show that various values of KL-exponent associated with the objective function can raise PADMM with three different convergence rates. More precisely, we show that if the (KL) exponent theta=0, the sequence generated by PADMM converges in a finite numbers of iterations. If theta is an element of(0,1/2], then the sequential rate of convergence is cQk where c>0, Q is an element of(0,1), and k is an element of N is the iteration number. If theta is an element of(1/2,1], then O(1/kr) rate where r=(1-theta)/(2 theta-1) is achieved.
引用
收藏
页码:966 / 998
页数:33
相关论文
共 70 条
[1]  
[Anonymous], 1969, Optimization
[2]   On the convergence of the proximal algorithm for nonsmooth functions involving analytic features [J].
Attouch, Hedy ;
Bolte, Jerome .
MATHEMATICAL PROGRAMMING, 2009, 116 (1-2) :5-16
[3]   Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel methods [J].
Attouch, Hedy ;
Bolte, Jerome ;
Svaiter, Benar Fux .
MATHEMATICAL PROGRAMMING, 2013, 137 (1-2) :91-129
[4]   Proximal Alternating Minimization and Projection Methods for Nonconvex Problems: An Approach Based on the Kurdyka-Lojasiewicz Inequality [J].
Attouch, Hedy ;
Bolte, Jerome ;
Redont, Patrick ;
Soubeyran, Antoine .
MATHEMATICS OF OPERATIONS RESEARCH, 2010, 35 (02) :438-457
[5]   Optimization with Sparsity-Inducing Penalties [J].
Bach, Francis ;
Jenatton, Rodolphe ;
Mairal, Julien ;
Obozinski, Guillaume .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 4 (01) :1-106
[6]  
Bertsekas D., 1984, NUMERICAL METHODS NO
[7]   THE LOJASIEWICZ INEQUALITY FOR NONSMOOTH SUBANALYTIC FUNCTIONS WITH APPLICATIONS TO SUBGRADIENT DYNAMICAL SYSTEMS [J].
Bolte, Jerome ;
Daniilidis, Aris ;
Lewis, Adrian .
SIAM JOURNAL ON OPTIMIZATION, 2007, 17 (04) :1205-1223
[8]   Clarke subgradients of stratifiable functions [J].
Bolte, Jerome ;
Daniilidis, Aris ;
Lewis, Adrian ;
Shiota, Masahiro .
SIAM JOURNAL ON OPTIMIZATION, 2007, 18 (02) :556-572
[9]   Proximal alternating linearized minimization for nonconvex and nonsmooth problems [J].
Bolte, Jerome ;
Sabach, Shoham ;
Teboulle, Marc .
MATHEMATICAL PROGRAMMING, 2014, 146 (1-2) :459-494
[10]   CHARACTERIZATIONS OF LOJASIEWICZ INEQUALITIES: SUBGRADIENT FLOWS, TALWEG, CONVEXITY [J].
Bolte, Jerome ;
Daniilidis, Aris ;
Ley, Olivier ;
Mazet, Laurent .
TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 2010, 362 (06) :3319-3363