Energy-Efficient Multi-Agent Reinforcement Learning for UAV Trajectory Optimization in Cell-Free Massive MIMO Networks

被引:1
作者
Liu, Zhilong [1 ,2 ]
Zhang, Jiayi [1 ,2 ]
Zeng, Yong [3 ,4 ]
Ai, Bo [1 ,2 ]
机构
[1] State Key Lab Adv Rail Autonomous Operat, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
[3] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
[4] Purple Mt Labs, Nanjing 211111, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous aerial vehicles; Heuristic algorithms; Space-air-ground integrated networks; Computer architecture; Training; Power demand; Uplink; Satellites; Artificial intelligence; Trajectory optimization; Cell-free massive MIMO; multi-agent reinforcement learning; trajectory optimization; uncrewed aerial vehicle; REQUIREMENTS; ARCHITECTURE;
D O I
10.1109/TWC.2025.3550266
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To enhance global data transmission, uncrewed aerial vehicle (UAV)-aided space-air-ground integrated networks (SAGIN) represent a pivotal direction for future advancements. In this paper, we focus on the trajectory optimization problem with the goal of maximizing the energy efficiency (EE), thereby balancing the system capacity with energy expenditure. To this end, we first introduce a cell-free SAGIN network where UAVs function as flying access points to serve ground user equipment (GUE). Given that the transmission power of satellite direct-to-cell devices typically exceeds that of GUEs, we investigate the interference effect and derive exact closed-form expressions for the uplink spectral efficiency. In order to improve the service access efficiency, a GUE grouping scheme based on density distribution is proposed. Then, an effective EE analysis model is established considering the power consumption of fixed-wing UAVs. To solve the UAV trajectory optimization problem, two algorithms over two timescales are proposed: a successive convex approximation strategy and a multi-agent reinforcement learning (MARL)-based algorithm. In particular, to reduce the algorithmic complexity, we employ a shared Critic network in the proposed MARL algorithm to reduce the training parameters. Importantly, our approach comprehensively optimizes the UAV trajectory, acceleration, and velocity parameters. The results show that the proposed GUE grouping algorithm and the MARL-based optimization algorithm demonstrate adaptability in dynamic time-varying environments.
引用
收藏
页码:5917 / 5930
页数:14
相关论文
共 42 条
[1]   Future Ultra-Dense LEO Satellite Networks: A Cell-Free Massive MIMO Approach [J].
Abdelsadek, Mohammed Y. ;
Yanikomeroglu, Halim ;
Kurt, Gunes Karabulut .
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
[2]   A Tutorial on Holographic MIMO Communications-Part II: Performance Analysis and Holographic Beamforming [J].
An, Jiancheng ;
Yuen, Chau ;
Huang, Chongwen ;
Debbah, Merouane ;
Poor, H. Vincent ;
Hanzo, Lajos .
IEEE COMMUNICATIONS LETTERS, 2023, 27 (07) :1669-1673
[3]   Making Cell-Free Massive MIMO Competitive With MMSE Processing and Centralized Implementation [J].
Bjornson, Emil ;
Sanguinetti, Luca .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (01) :77-90
[4]   Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks [J].
Chen, Binqiang ;
Liu, Dong ;
Hanzo, Lajos .
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, :3983-3988
[5]   Hierarchical Domain-Based Multicontroller Deployment Strategy in SDN-Enabled Space-Air-Ground Integrated Network [J].
Chen, Chen ;
Liao, Zhan ;
Ju, Ying ;
He, Ci ;
Yu, Keping ;
Wan, Shaohua .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (06) :4864-4879
[6]   THE TRENDS, CHALLENGES, AND KEY TECHNOLOGIES OF BEAM-SPACE MULTIPLEXING IN THE INTEGRATED TERRISTRIAL-SATELLITE COMMUNICATION FOR B5G AND 6G [J].
Chen, Shanzhi ;
Sun, Shaohui ;
Miao, Deshan ;
Shi, Tong ;
Cui, Yupeng .
IEEE WIRELESS COMMUNICATIONS, 2023, 30 (06) :77-86
[7]   Pilot Assignment and Power Control in Secure UAV-Enabled Cell-Free Massive MIMO Networks [J].
Chen, Yong ;
Zhang, Xianyu ;
Yao, Fuqiang ;
An, Kang ;
Zheng, Gan ;
Chatzinotas, Symeon .
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02) :3377-3391
[8]   Space-air-ground integrated network (SAGIN) for 6G: Requirements, architecture and challenges [J].
Cui, Huanxi ;
Zhang, Jun ;
Geng, Yuhui ;
Xiao, Zhenyu ;
Sun, Tao ;
Zhang, Ning ;
Liu, Jiajia ;
Wu, Qihui ;
Cao, Xianbin .
CHINA COMMUNICATIONS, 2022, 19 (02) :90-108
[9]   Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks [J].
Cui, Jingjing ;
Liu, Yuanwei ;
Nallanathan, Arumugam .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) :729-743
[10]   Analysis of UAV Communications in Cell-Free Massive MIMO Systems [J].
D'Andrea, Carmen ;
Garcia-Rodriguez, Adrian ;
Geraci, Giovanni ;
Giordano, Lorenzo Galati ;
Buzzi, Stefano .
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 :133-147