DECENTRALIZED STOCHASTIC NON-CONVEX OPTIMIZATION OVER WEAKLY CONNECTED TIME-VARYING DIGRAPHS

被引：0

作者：

Lu, Songtao ^{[1
]}

Wu, Chai Wah ^{[1
]}

机构：

[1] IBM Res AI, IBM Thomas J Waston Res Ctr, Yorktown Hts, NY 10598 USA

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年

关键词：

Weakly connected digraphs; time-varying; non-convex optimization; decentralized; gradient tracking; push sum; DISTRIBUTED OPTIMIZATION; CONVERGENCE; ALGORITHM; CONSENSUS;

D O I：

10.1109/icassp40776.2020.9053238

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we consider decentralized stochastic non-convex optimization over a class of weakly connected digraphs. First, we quantify the convergence behaviors of the weight matrices of this type of digraphs. By leveraging the perturbed push sum protocol and gradient tracking techniques, we propose a decentralized stochastic algorithm that is able to converge to the first-order stationary points of non-convex problems with provable convergence rates. Our digraph structure considered in this work generalizes the existing settings such that the proposed algorithm can be applied to more practical decentralized learning scenarios. Numerical results showcase the strengths of our theory and superiority of the proposed algorithm in decentralized training problems compared with the existing counterparts.

引用

页码：5770 / 5774

页数：5

共 29 条

[1] Assran M, 2019, PR MACH LEARN RES, V97
[2] Convergence of a Multi-Agent Projected Stochastic Gradient Algorithm for Non-Convex Optimization
Bianchi, Pascal
Jakubowicz, Jeremie
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2013, 58 (02) : 391 - 405
[3] Distributed optimization and statistical learning via the alternating direction method of multipliers
Boyd S.
Parikh N.
Chu E.
Peleato B.
Eckstein J.
[J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122
[4] Conditions for weak ergodicity of inhomogeneous Markov chains
Coppersmith, Don
Wu, Chai Wah
[J]. STATISTICS & PROBABILITY LETTERS, 2008, 78 (17) : 3082 - 3085
[5] NEXT: In-Network Nonconvex Optimization
Di Lorenzo, Paolo
Scutari, Gesualdo
[J]. IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2016, 2 (02): : 120 - 136
[6] CONVERGENCE ANALYSIS OF ALTERNATING DIRECTION METHOD OF MULTIPLIERS FOR A FAMILY OF NONCONVEX PROBLEMS
Hong, Mingyi
Luo, Zhi-Quan
Razaviyayn, Meisam
[J]. SIAM JOURNAL ON OPTIMIZATION, 2016, 26 (01) : 337 - 364
[7] Linear Convergence Rate of a Class of Distributed Augmented Lagrangian Algorithms
Jakovetic, Dusan
Moura, Jose M. F.
Xavier, Joao
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (04) : 922 - 936
[8] Lian X., 2017, Advances in neural information processing systems (NeurIPS 2017), V30, P5330
[9] Diffusion adaptive networks with changing topologies
Lopes, Cassio G.
Sayed, Ali H.
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3285 - 3288
[10] Lu S., 2019, NEURIPS WORKSH FED L

← 1 2 3 →