Three-Stage Stackelberg Game Enabled Clustered Federated Learning in Heterogeneous UAV Swarms

被引：32

作者：

He, Wenji ^{[1
]}

Yao, Haipeng ^{[1
]}

Mai, Tianle ^{[1
]}

Wang, Fu ^{[2
]}

Guizani, Mohsen ^{[3
]}

机构：

[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

[2] Beijing Univ Posts & Telecommun, Sch Elect & Engn, Beijing 100876, Peoples R China

[3] Mohamed Bin Zayed Univ Artificial Intelligence MB, Machine Learning Dept, Abu Dhabi, U Arab Emirates

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2023年 / 72卷 / 07期

基金：

中国国家自然科学基金;

关键词：

UAV swarms; clustered federated learning; Stackelberg game; multi-agent reinforcement learning; INTERNET; THINGS;

D O I：

10.1109/TVT.2023.3246636

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In the past decade, the unmanned aerial vehicles (UAVs) swarm has become a disruptive force reshaping our lives and work. In particular, advances in artificial intelligence have allowed multiple UAVs to coordinate their operations and work together to accomplish various complex tasks, one of which is Federated Learning (FL). As a promising distributed learning paradigm, FL can be adopted well with the limited resources and dynamic network topology of UAV swarms. However, the current FL's training process relies on homogeneous data paradigms, which require distributed UAVs to hold the same structure data. This ideal hypothesis can not apply to the heterogeneous UAV swarms. To tackle this problem, in this paper, we design a clustered federated learning (CFL) architecture, in which we cluster UAV swarms based on the similarities between the participants' optimization directions. Then, we formulate the model trading among model owners, cluster heads, and UAV workers as a three-stage Stackelberg game to optimize the allocation of the limited resources. We design a hierarchical reinforcement learning algorithm to search for the Stackelberg equilibrium under the clustered federated learning system. The performance evaluation demonstrates the uniqueness and stability of the proposed three-stage leader-follower game under the clustered framework, as well as the convergence and effectiveness of the reinforcement learning algorithm.

引用

页码：9366 / 9380

页数：15

共 37 条

[1]

Abad MSH, 2020, INT CONF ACOUST SPEE, P8866, DOI [10.1109/icassp40776.2020.9054634, 10.1109/ICASSP40776.2020.9054634]

[2] Stackelberg versus Cournot equilibrium [J].

Amir, R ;

Grilo, I .

GAMES AND ECONOMIC BEHAVIOR, 1999, 26 (01) :1-21

[3]

[Anonymous], 1991, Game Theory

[4] Hierarchical Reinforcement Learning for Blockchain-Assisted Software Defined Industrial Energy Market [J].

Cao, Yifan ;

Ren, Xiaoxu ;

Qiu, Chao ;

Wang, Xiaofei .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) :6100-6108

[5] Wireless Communications for Collaborative Federated Learning [J].

Chen, Mingzhe ;

Poor, H. Vincent ;

Saad, Walid ;

Cui, Shuguang .

IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (12) :48-54

[6]

Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217

[7]

Dong LX, 2020, INT CONF WIRE COMMUN, P227, DOI 10.1109/WCSP49889.2020.9299689

[8]

Ghosh A, 2019, Arxiv, DOI arXiv:1906.06629

[9]

Arivazhagan MG, 2019, Arxiv, DOI arXiv:1912.00818

[10] Edge Intelligence-Driven Joint Offloading and Resource Allocation for Future 6G Industrial Internet of Things [J].

Gong, Yongkang ;

Yao, Haipeng ;

Wang, Jingjing ;

Li, Maozhen ;

Guo, Song .

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06) :5644-5655

← 1 2 3 4 →