3D Deployment and Energy Efficiency Optimization Based on DRL for RIS-Assisted Air-to-Ground Communications Networks

被引：1

作者：

Yao, Yuanyuan ^{[1
,2
]}

Lv, Ke ^{[1
,2
]}

Huang, Sai ^{[3
]}

Xiang, Wei ^{[4
,5
]}

机构：

[1] Beijing Informat Sci & Technol Univ, Key Lab Informat & Commun Syst, Minist Informat Ind, Beijing 100101, Peoples R China

[2] Beijing Informat Sci & Technol Univ, Key Lab Modern Measurement & Control Technol, Minist Educ, Beijing 100101, Peoples R China

[3] Beijing Univ Posts & Telecommun, Key Lab Universal Wireless Commun, Minist Educ, Beijing 100876, Peoples R China

[4] Trobe Univ, Sch Comp Engn & Math Sci, Melbourne, Vic 3086, Australia

[5] James Cook Univ, Coll Sci & Engn, Cairns, Qld 4878, Australia

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2024年 / 73卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Autonomous aerial vehicles; Optimization; Trajectory; Three-dimensional displays; Resource management; Relays; Array signal processing; Unmanned aerial vehicle (UAV); reconfigurable intelligent surface (RIS); deep reinforcement learning (DRL); dueling double deep Q-learning (D3QN); three-dimensional (3D) deployment; RECONFIGURABLE INTELLIGENT SURFACES; UAV COMMUNICATIONS; DESIGN; SKY;

D O I：

10.1109/TVT.2024.3405608

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Research indicates that replacing relays with the reconfigurable intelligent surface (RIS) can effectively reduce energy consumption under certain conditions. Therefore, introducing RIS into unmanned aerial vehicle (UAV) assisted air-to-ground communication networks can further enhance communication performance. In this paper, we propose a RIS-UAV-assisted communication network scenario that utilizes the zero-forcing (ZF) precoding method to eliminate multi-user interference, and then optimize the BS transmit power by using the Dinkelbach algorithm. To address the optimization problem of bandwidth allocation, RIS phase shifts, and three-dimensional (3D) coordinates of the RIS-UAV, we propose two deep reinforcement learning (DRL) algorithms, which are termed D3QN-MM and D3QN-Pure, respectively. Both D3QN-MM and D3QN-Pure utilize the dueling double deep Q-network (D3QN) for optimizing bandwidth allocation and the 3D coordinates of the RIS-UAV. However, D3QN-MM employs the traditional majorize-minimization (MM) algorithm for RIS phase shifts optimization, while D3QN-Pure utilizes the D3QN. By comparing them with other algorithms, such as the DRL algorithm, the advantages of the algorithms proposed in this paper are highlighted. Furthermore, compared to the amplify-and-forward (AF) relay, the RIS can achieve a 48% energy efficiency improvement. Besides, the D3QN-Pure algorithm provided up to 14.3% energy effciency improvement compared to the D3QN-MM algorithm.

引用

页码：14988 / 15003

页数：16