First-Order Methods for Convex Optimization

被引：15

作者：

Dvurechensky, Pavel ^{[1
,2
,3
]}

Shtern, Shimrit ^{[4
]}

Staudigl, Mathias ^{[5
,6
]}

机构：

[1] Weierstrass Inst Appl Anal & Stochast, Mohrenstr 39, D-10117 Berlin, Germany

[2] Inst Informat Transmiss Problems RAS, Bolshoy Karetny Per 19,Build 1, Moscow 127051, Russia

[3] Moscow Inst Phys & Technol, 9 Inst Skiy Per, Dolgoprudnyi 141701, Moscow Region, Russia

[4] Technion Israel Inst Technol, Fac Ind Engn & Management, Haifa, Israel

[5] Maastricht Univ, Dept Data Sci & Knowledge Engn DKE, Paul Henri Spaaklaan 1, NL-6229 EN Maastricht, Netherlands

[6] Maastricht Univ, Math Ctr Maastricht MCM, Paul Henri Spaaklaan 1, NL-6229 EN Maastricht, Netherlands

来源：

EURO JOURNAL ON COMPUTATIONAL OPTIMIZATION | 2021年 / 9卷

关键词：

Convex Optimization; Composite Optimization; First-Order Methods; Numerical Algorithms; Convergence Rate; Proximal Mapping; Proximity Operator; Bregman Divergence; STOCHASTIC COMPOSITE OPTIMIZATION; PROJECTED SUBGRADIENT METHODS; INTERMEDIATE GRADIENT-METHOD; COORDINATE DESCENT METHODS; VARIATIONAL-INEQUALITIES; MIRROR DESCENT; FRANK-WOLFE; APPROXIMATION ALGORITHMS; THRESHOLDING ALGORITHM; MINIMIZATION ALGORITHM;

D O I：

10.1016/j.ejco.2021.100015

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

First-order methods for solving convex optimization problems have been at the forefront of mathematical optimization in the last 20 years. The rapid development of this important class of algorithms is motivated by the success stories reported in various applications, including most importantly machine learning, signal processing, imaging and control theory. First-order methods have the potential to provide low accuracy solutions at low computational complexity which makes them an attractive set of tools in large-scale optimization problems. In this survey, we cover a number of key developments in gradient-based optimization methods. This includes non-Euclidean extensions of the classical proximal gradient method, and its accelerated versions. Additionally we survey recent developments within the class of projection-free methods, and proximal versions of primal dual schemes. We give complete proofs for various key results, and highlight the unifying aspects of several optimization algorithms.

引用

页数：27

共 237 条

[1] Linear convergence of a modified Frank-Wolfe algorithm for computing minimum-volume enclosing ellipsoids [J].