Privacy Preserving Distributed Bandit Residual Feedback Online Optimization Over Time-Varying Unbalanced Graphs

被引:1
作者
Zhao, Zhongyuan [1 ,2 ]
Yang, Zhiqiang [1 ,2 ]
Jiang, Luyao [3 ]
Yang, Ju [1 ,2 ]
Ge, Quanbo [1 ,2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Automat, Nanjing 210044, Peoples R China
[2] Jiangsu Key Lab Big Data Anal Technol, Nanjing 210044, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Mech Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Location awareness; Privacy; Differential privacy; Accuracy; Federated learning; Heuristic algorithms; Collaboration; Information leakage; Cost function; Protection; distributed online optimization (DOO); federated learning; one-point residual feedback (OPRF); time-varying unbalanced graphs; CONVEX-OPTIMIZATION; CONVERGENCE RATE; ALGORITHM;
D O I
10.1109/JAS.2024.124656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper considers the distributed online optimization (DOO) problem over time-varying unbalanced networks, where gradient information is explicitly unknown. To address this issue, a privacy-preserving distributed online one-point residual feedback (OPRF) optimization algorithm is proposed. This algorithm updates decision variables by leveraging one-point residual feedback to estimate the true gradient information. It can achieve the same performance as the two-point feedback scheme while only requiring a single function value query per iteration. Additionally, it effectively eliminates the effect of time-varying unbalanced graphs by dynamically constructing row stochastic matrices. Furthermore, compared to other distributed optimization algorithms that only consider explicitly unknown cost functions, this paper also addresses the issue of privacy information leakage of nodes. Theoretical analysis demonstrate that the method attains sublinear regret while protecting the privacy information of agents. Finally, numerical experiments on distributed collaborative localization problem and federated learning confirm the effectiveness of the algorithm.
引用
收藏
页码:2284 / 2297
页数:14
相关论文
共 33 条
[1]   Distributed Online Convex Optimization on Time-Varying Directed Graphs [J].
Akbari, Mohammad ;
Gharesifard, Bahman ;
Linder, Tamas .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2017, 4 (03) :417-428
[2]   Privacy-Preserving Distributed Economic Dispatch of Microgrids: A Dynamic Quantization-Based Consensus Scheme With Homomorphic Encryption [J].
Chen, Wei ;
Liu, Lu ;
Liu, Guo-Ping .
IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (01) :701-713
[3]  
Ding T, 2018, IEEE DECIS CONTR P, P3409, DOI 10.1109/CDC.2018.8619119
[4]  
Dwork C., 2006, P AUT LANG PROGR 3 2, P1
[5]  
Flaxman AD, 2005, PROCEEDINGS OF THE SIXTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P385
[6]   Distributionally Robust Stochastic Optimization with Wasserstein Distance [J].
Gao, Rui ;
Kleywegt, Anton .
MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (02) :603-655
[7]   Cloud-Enabled Differentially Private Multiagent Optimization With Constraints [J].
Hale, Matthew T. ;
Egerstedt, Magnus .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2018, 5 (04) :1693-1706
[8]   Federated Learning Meets Multi-Objective Optimization [J].
Hu, Zeou ;
Shaloudegi, Kiarash ;
Zhang, Guojun ;
Yu, Yaoliang .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (04) :2039-2051
[9]   DP-ADMM: ADMM-Based Distributed Learning With Differential Privacy [J].
Huang, Zonghao ;
Hu, Rui ;
Guo, Yuanxiong ;
Chan-Tin, Eric ;
Gong, Yanmin .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 :1002-1012
[10]   Distributed Online Bandit Learning in Dynamic Environments Over Unbalanced Digraphs [J].
Li, Jueyou ;
Li, Chaojie ;
Yu, Wenwu ;
Zhu, Xiaomei ;
Yu, Xinghuo .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (04) :3034-3047