Large-scale post-disaster user distributed coverage optimization based on multi-agent reinforcement learning

被引:0
作者
Xu W. [1 ]
Wu S. [1 ]
Wang F. [1 ]
Lin L. [1 ]
Li G. [1 ]
Zhang Z. [3 ]
机构
[1] School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing
[2] Lab of BLOS Trusted Information Transmission, Chongqing University of Posts and Telecommunications, Chongqing
[3] School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing
来源
Tongxin Xuebao/Journal on Communications | 2022年 / 43卷 / 08期
基金
中国国家自然科学基金; 中央高校基本科研业务费专项资金资助; 国家重点研发计划;
关键词
coverage optimization; distributed training; emergency communication; multi-agent reinforcement learning;
D O I
10.11959/j.issn.1000-436x.2022131
中图分类号
学科分类号
摘要
In order to quickly restore emergency communication services for large-scale post-disaster users, a distributed intellicise coverage optimization architecture based on multi-agent reinforcement learning (RL) was proposed, which could address the significant differences and dynamics of communication services caused by a large number of access users, and the difficulty of expansion caused by centralized algorithms. Specifically, a distributed k-sums clustering algorithm considering service differences of users was designed in the network characterization layer, which could make each unmanned aerial vehicle base station (UAV-BS) adjust the local networking natively and simply, and obtain states of cluster center for multi-agent RL. In the trajectory control layer, multi-agent soft actor critic (MASAC) with distributed-training-distributed-execution structure was designed for UAV-BS to control trajectory as intelligent nodes. Furthermore, ensemble learning and curriculum learning were integrated to improve the stability and convergence speed of training process. The simulation results show that the proposed distributed k-sums algorithm is superior to the k-means in terms of average load efficiency and clustering balance, and MASAC based trajectory control algorithm can effectively reduce communication interruptions and improve the spectrum efficiency, which outperforms the existing RL algorithms. © 2022 Editorial Board of Journal on Communications. All rights reserved.
引用
收藏
页码:1 / 16
页数:15
相关论文
共 26 条
  • [1] DEEPAK G C, LADAS A, SAMBO Y A, Et al., An overview of post-disaster emergency communication systems in the future networks, IEEE Wireless Communications, 26, 6, pp. 132-139, (2019)
  • [2] GUO H Z, LI J Y, LIU J J, Et al., A survey on space-air-ground-sea integrated network security in 6G, IEEE Communications Surveys & Tutorials, 24, 1, pp. 53-87, (2022)
  • [3] ZHOU Y Q, LIU L, WANG L, Et al., Service-aware 6G: an intelligent and open network based on the convergence of communication, computing and caching, Digital Communications and Networks, 6, 3, pp. 253-260, (2020)
  • [4] ZHANG P, XU W J, GAO H, Et al., Toward wisdom-evolutionary and primitive-concise 6G: a new paradigm of semantic communication networks, Engineering, 8, pp. 60-73, (2022)
  • [5] ZHANG P, XU X D, HAN S J, Et al., Entropy reduced mobile networks empowering industrial applications, Journal of Beijing University of Posts and Telecommunications, 43, 6, pp. 1-9, (2020)
  • [6] ZHOU Y Q, TIAN L, LIU L, Et al., Fog computing enabled future mobile communication networks: a convergence of communication and computing, IEEE Communications Magazine, 57, 5, pp. 20-27, (2019)
  • [7] KANG Z Y, YOU C S, ZHANG R., 3D placement for multi-UAV relaying: an iterative Gibbs-sampling and block coordinate descent optimization approach, IEEE Transactions on Communications, 69, 3, pp. 2047-2062, (2021)
  • [8] YIN S X, LI L H, YU F R., Resource allocation and basestation placement in downlink cellular networks assisted by multiple wireless powered UAVs, IEEE Transactions on Vehicular Technology, 69, 2, pp. 2171-2184, (2020)
  • [9] ZHANG Y X, CHENG W C., Trajectory and power optimization for multi-UAV enabled emergency wireless communications networks, Proceedings of International Conference on Communications Workshops, pp. 1-6, (2019)
  • [10] LI X, WANG Q, LIU J, Et al., Trajectory design and generalization for UAV enabled networks: a deep reinforcement learning approach, Proceedings of Wireless Communications and Networking Conference, pp. 1-6, (2020)