Fastest rates for stochastic mirror descent methods

被引:10
作者
Hanzely, Filip [1 ]
Richtarik, Peter [1 ,2 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Thuwal, Saudi Arabia
[2] Moscow Inst Phys & Technol MIPT, Dolgoprudnyi, Russia
关键词
Gradient descent; Relative smoothness; Coordinate descent; Stochastic gradient descent; COORDINATE DESCENT;
D O I
10.1007/s10589-021-00284-5
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Relative smoothness-a notion introduced in Birnbaum et al. (Proceedings of the 12th ACM conference on electronic commerce, ACM, pp 127-136, 2011) and recently rediscovered in Bauschke et al. (Math Oper Res 330-348, 2016) and Lu et al. (Relatively-smooth convex optimization by first-order methods, and applications, , 2016)-generalizes the standard notion of smoothness typically used in the analysis of gradient type methods. In this work we are taking ideas from well studied field of stochastic convex optimization and using them in order to obtain faster algorithms for minimizing relatively smooth functions. We propose and analyze two new algorithms: Relative Randomized Coordinate Descent (relRCD) and Relative Stochastic Gradient Descent (relSGD), both generalizing famous algorithms in the standard smooth setting. The methods we propose can be in fact seen as particular instances of stochastic mirror descent algorithms, which has been usually analyzed under stronger assumptions: Lipschitzness of the objective and strong convexity of the reference function. As a consequence, one of the proposed methods, relRCD corresponds to the first stochastic variant of mirror descent algorithm with linear convergence rate.
引用
收藏
页码:717 / 766
页数:50
相关论文
共 42 条
  • [1] Afkanpour A., 2013, P 30 INT C MACH LEAR, V28, P374
  • [2] Allen- Zhu Z., 2014, ARXIV14071537
  • [3] [Anonymous], 2012, Advances in Neural Information Processing Systems
  • [4] A Descent Lemma Beyond Lipschitz Gradient Continuity: First-Order Methods Revisited and Applications
    Bauschke, Heinz H.
    Bolte, Jerome
    Teboulle, Marc
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2017, 42 (02) : 330 - 348
  • [5] Mirror descent and nonlinear projected subgradient methods for convex optimization
    Beck, A
    Teboulle, M
    [J]. OPERATIONS RESEARCH LETTERS, 2003, 31 (03) : 167 - 175
  • [6] Benning Martin, 2016, ARXIV161202506
  • [7] Image deblurring with Poisson data: from cells to galaxies
    Bertero, M.
    Boccacci, P.
    Desidera, G.
    Vicidomini, G.
    [J]. INVERSE PROBLEMS, 2009, 25 (12)
  • [8] Birnbaum B. E., 2011, P 12 ACM C EL COMM, P127
  • [9] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)