Distributed adaptive online learning for convex optimization with weight decay

被引:6
作者
Shen, Xiuyu [1 ]
Li, Dequan [1 ]
Fang, Runyue [1 ]
Zhou, Yuejin [1 ]
Wu, Xiongjun [2 ]
机构
[1] Anhui Univ Sci & Technol, Sch Math & Big Data, Huainan 232001, Peoples R China
[2] China Aerosp Sci & Technol Corp, Natl Def Key Lab Sci & Technol Electromagnet Scat, 802 Inst, Shanghai Acad Space Flight Technol,Eighth Acad, Shanghai, Peoples R China
基金
中国博士后科学基金; 国家重点研发计划;
关键词
adaptive gradient; convex optimization; distributed optimization; online learning; weight decay; SUBGRADIENT METHODS; CLASSIFICATION; TRACKING;
D O I
10.1002/asjc.2489
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates an adaptive gradient-based online convex optimization problem over decentralized networks. The nodes of a network aim to track the minimizer of a global time-varying convex function, and the communication pattern among nodes is captured as a connected undirected graph. To tackle such optimization problems in a collaborative and distributed manner, a weight decay distributed adaptive online gradient algorithm, called WDDAOG, is firstly proposed, which incorporates distributed optimization methods with adaptive strategies. Then, our theoretical analysis clearly illustrates the difference between weight decay and L-2 regularization for distributed adaptive gradient algorithms. The dynamic regret bound for the proposed algorithm is further analyzed. It is shown that the dynamic regret bound for convex functions grows with order of O(n(1 + logT) + root nT), where T and n represent the time horizon and the number of nodes associated with the network, respectively. Numerical experiments demonstrate that WDDAOG works well in practice and compares favorably to existing distributed online optimization schemes.
引用
收藏
页码:562 / 575
页数:14
相关论文
共 47 条
[1]   Distributed Online Convex Optimization on Time-Varying Directed Graphs [J].
Akbari, Mohammad ;
Gharesifard, Bahman ;
Linder, Tamas .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2017, 4 (03) :417-428
[2]  
[Anonymous], 2016, Trends Optim., DOI DOI 10.1561/2400000013
[3]  
[Anonymous], 2012, COURSERA NEURAL NETW
[4]   Mirror descent and nonlinear projected subgradient methods for convex optimization [J].
Beck, A ;
Teboulle, M .
OPERATIONS RESEARCH LETTERS, 2003, 31 (03) :167-175
[5]  
Bos S, 1996, IEEE IJCNN, P241, DOI 10.1109/ICNN.1996.548898
[6]  
Boyd S, 2004, SIAM REV, V46, P667, DOI [10.1137/S0036144503423264, 10.1137/s0036144503423264]
[7]   Finite-Time Connectivity-Preserving Consensus of Networked Nonlinear Agents With Unknown Lipschitz Terms [J].
Cao, Yongcan ;
Ren, Wei ;
Casbeer, David W. ;
Schumacher, Corey .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (06) :1700-1705
[8]   Centralized and decentralized distributed control of longitudinal vehicular platoons with non-uniform communication topology [J].
Chehardoli, Hossein ;
Ghasemi, Ali ;
Najafi, Ali .
ASIAN JOURNAL OF CONTROL, 2019, 21 (06) :2691-2699
[9]   Distributed Average Tracking of Multiple Time-Varying Reference Signals With Bounded Derivatives [J].
Chen, Fei ;
Cao, Yongcan ;
Ren, Wei .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (12) :3169-3174
[10]  
Colin I, 2016, PR MACH LEARN RES, V48