Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization

被引:13
作者
Yi, Xinlei [1 ,2 ]
Li, Xiuxian [3 ,4 ]
Yang, Tao [5 ]
Xie, Lihua [6 ]
Chai, Tianyou [5 ]
Johansson, Karl Henrik [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Sch Elect Engn & Comp Sci, Div Decis & Control Syst, S-10044 Stockholm, Sweden
[2] Digital Futures, S-10044 Stockholm, Sweden
[3] Tongji Univ, Coll Elect & Informat Engn, Dept Control Sci & Engn, Shanghai 200070, Peoples R China
[4] Tongji Univ, Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 200070, Peoples R China
[5] Northeastern Univ, State Key Lab Synthet Automation Proc Ind, Shenyang 110819, Peoples R China
[6] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
基金
中国国家自然科学基金; 瑞典研究理事会;
关键词
Convex functions; Measurement; Heuristic algorithms; Benchmark testing; Time measurement; Standards; Machine learning; Cumulative constraint violation; distributed optimization; online optimization; regret; time-varying constraints; ALGORITHM;
D O I
10.1109/TAC.2022.3230766
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers the distributed online convex optimization problem with time-varying constraints over a network of agents. This is a sequential decision making problem with two sequences of arbitrarily varying convex loss and constraint functions. At each round, each agent selects a decision from the decision set, and then only a portion of the loss function and a coordinate block of the constraint function at this round are privately revealed to this agent. The goal of the network is to minimize the network-wide loss accumulated over time. Two distributed online algorithms with full-information and bandit feedback are proposed. Both dynamic and static network regret bounds are analyzed for the proposed algorithms, and network cumulative constraint violation is used to measure constraint violation, which excludes the situation that strictly feasible constraints can compensate the effects of violated constraints. In particular, we show that the proposed algorithms achieve O(T-max{k, 1-k.}) static network regret and O (T1-k/2) network cumulative constraint violation, where T is the time horizon and.k epsilon (0, 1) is a user-defined tradeoff parameter. Moreover, if the loss functions are strongly convex, then the static network regret bound can be reduced to O(T-k). Finally, numerical simulations are provided to illustrate the effectiveness of the theoretical results.
引用
收藏
页码:2875 / 2890
页数:16
相关论文
共 59 条
[1]   Individual Regret Bounds for the Distributed Online Alternating Direction Method of Multipliers [J].
Akbari, Mohammad ;
Gharesifard, Bahman ;
Linder, Lamas .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (04) :1746-1752
[2]   Distributed Online Convex Optimization on Time-Varying Directed Graphs [J].
Akbari, Mohammad ;
Gharesifard, Bahman ;
Linder, Tamas .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2017, 4 (03) :417-428
[3]  
[Anonymous], 2016, Foundations and Trends in Optimization
[4]  
[Anonymous], 2010, C LEARNING THEORY
[5]   Online Convex Optimization With Time-Varying Constraints and Bandit Feedback [J].
Cao, Xuanyu ;
Liu, K. J. Ray .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (07) :2665-2680
[6]   GTAdam: Gradient Tracking With Adaptive Momentum for Distributed Online Optimization [J].
Carnevale, Guido ;
Farina, Francesco ;
Notarnicola, Ivano ;
Notarstefano, Giuseppe .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (03) :1436-1448
[7]   Worst-case quadratic loss bounds for prediction using linear functions and gradient descent [J].
CesaBianchi, N ;
Long, PM ;
Warmuth, MK .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (03) :604-619
[8]   Bandit Convex Optimization for Scalable and Dynamic IoT Management [J].
Chen, Tianyi ;
Giannakis, Georgios B. .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (01) :1276-1286
[9]   Heterogeneous Online Learning for "Thing-Adaptive" Fog Computing in IoT [J].
Chen, Tianyi ;
Ling, Qing ;
Shen, Yanning ;
Giannakis, Georgios B. .
IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (06) :4328-4341
[10]   An Online Convex Optimization Approach to Proactive Network Resource Allocation [J].
Chen, Tianyi ;
Ling, Qing ;
Giannakis, Georgios B. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (24) :6350-6364