Three-Stage Stackelberg Game Enabled Clustered Federated Learning in Heterogeneous UAV Swarms

被引:21
作者
He, Wenji [1 ]
Yao, Haipeng [1 ]
Mai, Tianle [1 ]
Wang, Fu [2 ]
Guizani, Mohsen [3 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Elect & Engn, Beijing 100876, Peoples R China
[3] Mohamed Bin Zayed Univ Artificial Intelligence MB, Machine Learning Dept, Abu Dhabi, U Arab Emirates
基金
中国国家自然科学基金;
关键词
UAV swarms; clustered federated learning; Stackelberg game; multi-agent reinforcement learning; INTERNET; THINGS;
D O I
10.1109/TVT.2023.3246636
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the past decade, the unmanned aerial vehicles (UAVs) swarm has become a disruptive force reshaping our lives and work. In particular, advances in artificial intelligence have allowed multiple UAVs to coordinate their operations and work together to accomplish various complex tasks, one of which is Federated Learning (FL). As a promising distributed learning paradigm, FL can be adopted well with the limited resources and dynamic network topology of UAV swarms. However, the current FL's training process relies on homogeneous data paradigms, which require distributed UAVs to hold the same structure data. This ideal hypothesis can not apply to the heterogeneous UAV swarms. To tackle this problem, in this paper, we design a clustered federated learning (CFL) architecture, in which we cluster UAV swarms based on the similarities between the participants' optimization directions. Then, we formulate the model trading among model owners, cluster heads, and UAV workers as a three-stage Stackelberg game to optimize the allocation of the limited resources. We design a hierarchical reinforcement learning algorithm to search for the Stackelberg equilibrium under the clustered federated learning system. The performance evaluation demonstrates the uniqueness and stability of the proposed three-stage leader-follower game under the clustered framework, as well as the convergence and effectiveness of the reinforcement learning algorithm.
引用
收藏
页码:9366 / 9380
页数:15
相关论文
共 37 条
  • [1] Abad MSH, 2020, INT CONF ACOUST SPEE, P8866, DOI [10.1109/icassp40776.2020.9054634, 10.1109/ICASSP40776.2020.9054634]
  • [2] Stackelberg versus Cournot equilibrium
    Amir, R
    Grilo, I
    [J]. GAMES AND ECONOMIC BEHAVIOR, 1999, 26 (01) : 1 - 21
  • [3] Hierarchical Reinforcement Learning for Blockchain-Assisted Software Defined Industrial Energy Market
    Cao, Yifan
    Ren, Xiaoxu
    Qiu, Chao
    Wang, Xiaofei
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (09) : 6100 - 6108
  • [4] Wireless Communications for Collaborative Federated Learning
    Chen, Mingzhe
    Poor, H. Vincent
    Saad, Walid
    Cui, Shuguang
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (12) : 48 - 54
  • [5] Cohen G, 2017, IEEE IJCNN, P2921, DOI 10.1109/IJCNN.2017.7966217
  • [6] Dong LX, 2020, INT CONF WIRE COMMUN, P227, DOI 10.1109/WCSP49889.2020.9299689
  • [7] Fudenberg D., 1991, Game Theory
  • [8] Ghosh A, 2019, Arxiv, DOI [arXiv:1906.06629, DOI 10.48550/ARXIV.1906.06629]
  • [9] Arivazhagan MG, 2019, Arxiv, DOI arXiv:1912.00818
  • [10] Edge Intelligence-Driven Joint Offloading and Resource Allocation for Future 6G Industrial Internet of Things
    Gong, Yongkang
    Yao, Haipeng
    Wang, Jingjing
    Li, Maozhen
    Guo, Song
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5644 - 5655