INCREMENTAL MAJORIZATION-MINIMIZATION OPTIMIZATION WITH APPLICATION TO LARGE-SCALE MACHINE LEARNING

被引:161
作者
Mairal, Julien [1 ]
机构
[1] Univ Grenoble Alpes, CNRS, Lab Jean Kuntzmann, Inria,LEAR Team, F-38330 Montbonnot St Martin, France
关键词
nonconvex optimization; convex optimization; majorization-minimization; THRESHOLDING ALGORITHM; ONLINE; SPARSITY; CONVERGENCE;
D O I
10.1137/140957639
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Majorization-minimization algorithms consist of successively minimizing a sequence of upper bounds of the objective function. These upper bounds are tight at the current estimate, and each iteration monotonically drives the objective function downhill. Such a simple principle is widely applicable and has been very popular in various scientific fields, especially in signal processing and statistics. We propose an incremental majorization-minimization scheme for minimizing a large sum of continuous functions, a problem of utmost importance in machine learning. We present convergence guarantees for nonconvex and convex optimization when the upper bounds approximate the objective up to a smooth error; we call such upper bounds "first-order surrogate functions." More precisely, we study asymptotic stationary point guarantees for nonconvex problems, and for convex ones, we provide convergence rates for the expected objective function value. We apply our scheme to composite optimization and obtain a new incremental proximal gradient algorithm with linear convergence rate for strongly convex functions. Our experiments show that our method is competitive with the state of the art for solving machine learning problems such as logistic regression when the number of training samples is large enough, and we demonstrate its usefulness for sparse estimation with nonconvex penalties.
引用
收藏
页码:829 / 855
页数:27
相关论文
共 57 条
  • [1] Convergent incremental optimization transfer algorithms: Application to tomography
    Ahn, S
    Fessler, JA
    Blatt, D
    Hero, AO
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2006, 25 (03) : 283 - 296
  • [2] [Anonymous], P ICML
  • [3] [Anonymous], 2010, FIXED POINT ALGORITH
  • [4] [Anonymous], 1999, Nonlinear Programming
  • [5] [Anonymous], ARXIV13074457V2
  • [6] [Anonymous], ARXIV12112717
  • [7] [Anonymous], 2013, Introductory lectures on convex optimization: A basic course
  • [8] [Anonymous], CMUCS01109
  • [9] [Anonymous], P ADV NEUR INF PROC
  • [10] [Anonymous], P ICML