DECENTRALIZED STOCHASTIC NON-CONVEX OPTIMIZATION OVER WEAKLY CONNECTED TIME-VARYING DIGRAPHS

被引:0
作者
Lu, Songtao [1 ]
Wu, Chai Wah [1 ]
机构
[1] IBM Res AI, IBM Thomas J Waston Res Ctr, Yorktown Hts, NY 10598 USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
Weakly connected digraphs; time-varying; non-convex optimization; decentralized; gradient tracking; push sum; DISTRIBUTED OPTIMIZATION; CONVERGENCE; ALGORITHM; CONSENSUS;
D O I
10.1109/icassp40776.2020.9053238
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we consider decentralized stochastic non-convex optimization over a class of weakly connected digraphs. First, we quantify the convergence behaviors of the weight matrices of this type of digraphs. By leveraging the perturbed push sum protocol and gradient tracking techniques, we propose a decentralized stochastic algorithm that is able to converge to the first-order stationary points of non-convex problems with provable convergence rates. Our digraph structure considered in this work generalizes the existing settings such that the proposed algorithm can be applied to more practical decentralized learning scenarios. Numerical results showcase the strengths of our theory and superiority of the proposed algorithm in decentralized training problems compared with the existing counterparts.
引用
收藏
页码:5770 / 5774
页数:5
相关论文
共 29 条
  • [1] Assran M, 2019, PR MACH LEARN RES, V97
  • [2] Convergence of a Multi-Agent Projected Stochastic Gradient Algorithm for Non-Convex Optimization
    Bianchi, Pascal
    Jakubowicz, Jeremie
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (02) : 391 - 405
  • [3] Distributed optimization and statistical learning via the alternating direction method of multipliers
    Boyd S.
    Parikh N.
    Chu E.
    Peleato B.
    Eckstein J.
    [J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122
  • [4] Conditions for weak ergodicity of inhomogeneous Markov chains
    Coppersmith, Don
    Wu, Chai Wah
    [J]. STATISTICS & PROBABILITY LETTERS, 2008, 78 (17) : 3082 - 3085
  • [5] NEXT: In-Network Nonconvex Optimization
    Di Lorenzo, Paolo
    Scutari, Gesualdo
    [J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2016, 2 (02): : 120 - 136
  • [6] CONVERGENCE ANALYSIS OF ALTERNATING DIRECTION METHOD OF MULTIPLIERS FOR A FAMILY OF NONCONVEX PROBLEMS
    Hong, Mingyi
    Luo, Zhi-Quan
    Razaviyayn, Meisam
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2016, 26 (01) : 337 - 364
  • [7] Linear Convergence Rate of a Class of Distributed Augmented Lagrangian Algorithms
    Jakovetic, Dusan
    Moura, Jose M. F.
    Xavier, Joao
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (04) : 922 - 936
  • [8] Lian X., 2017, Advances in neural information processing systems (NeurIPS 2017), V30, P5330
  • [9] Diffusion adaptive networks with changing topologies
    Lopes, Cassio G.
    Sayed, Ali H.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3285 - 3288
  • [10] Lu S., 2019, NEURIPS WORKSH FED L