Distributed Online Convex Optimization With an Aggregative Variable

被引:54
作者
Li, Xiuxian [1 ,2 ]
Yi, Xinlei [3 ]
Xie, Lihua [4 ]
机构
[1] Inst Adv Study, Dept Control Sci & Engn, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[2] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 201804, Peoples R China
[3] KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, Div Decis & Control Syst, S-10044 Stockholm, Sweden
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
来源
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS | 2022年 / 9卷 / 01期
基金
中国国家自然科学基金;
关键词
Aggregative variable; distributed algorithms; dynamic regret; multiagent networks; online convex optimization; ALGORITHM; REGRET;
D O I
10.1109/TCNS.2021.3107480
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article investigates distributed online convex optimization in the presence of an aggregative variable without any global/central coordinators over a multiagent network. In this problem, each individual agent is only able to access partial information of time-varying global loss functions, thus requiring local information exchanges between neighboring agents. Motivated by many applications in reality, the considered local loss functions depend not only on their own decision variables, but also on an aggregative variable, such as the average of all decision variables. To handle this problem, an online distributed gradient tracking algorithm (O-DGT) is proposed with exact gradient information and it is shown that the dynamic regret is upper bounded by three terms: 1) a sublinear term; 2) a path variation term; and 3) a gradient variation term. Meanwhile, the O-DGT algorithm is also analyzed with stochastic/noisy gradients, showing that the expected dynamic regret has the same upper bound as the exact gradient case. To our best knowledge, this article is the first to study online convex optimization in the presence of an aggregative variable, which enjoys new characteristics in comparison with the conventional scenario without the aggregative variable. Finally, a numerical experiment is provided to corroborate the obtained theoretical results.
引用
收藏
页码:438 / 449
页数:12
相关论文
共 41 条
[1]   Distributed Online Convex Optimization on Time-Varying Directed Graphs [J].
Akbari, Mohammad ;
Gharesifard, Bahman ;
Linder, Tamas .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2017, 4 (03) :417-428
[2]  
Bastianello N., 2020, ARXIV200100870
[3]   Online Convex Optimization With Time-Varying Constraints and Bandit Feedback [J].
Cao, Xuanyu ;
Liu, K. J. Ray .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (07) :2665-2680
[4]  
Chen L, 2018, PR MACH LEARN RES, V80
[5]   An Online Convex Optimization Approach to Proactive Network Resource Allocation [J].
Chen, Tianyi ;
Ling, Qing ;
Giannakis, Georgios B. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (24) :6350-6364
[6]   Online Learning Over Dynamic Graphs via Distributed Proximal Gradient Algorithm [J].
Dixit, Rishabh ;
Bedi, Amrit Singh ;
Rajawat, Ketan .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (11) :5065-5079
[7]  
Garg V., 2017, ADV NEURAL INFORM PR, P5341
[8]   Logarithmic regret algorithms for online convex optimization [J].
Hazan, Elad ;
Agarwal, Amit ;
Kale, Satyen .
MACHINE LEARNING, 2007, 69 (2-3) :169-192
[9]   Online Distributed Convex Optimization on Dynamic Networks [J].
Hosseini, Saghar ;
Chapman, Airlie ;
Mesbahi, Mehran .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2016, 61 (11) :3545-3550
[10]  
Jadbabaie A, 2015, JMLR WORKSH CONF PRO, V38, P398