共 40 条
Exploiting NOMA Transmissions in Multi-UAV-Assisted Wireless Networks: From Aerial-RIS to Mode-Switching UAVs
被引:0
作者:
Zhao, Songhan
[1
,2
]
Gong, Shimin
[1
,2
]
Gu, Bo
[1
,2
]
Li, Lanhua
[1
,2
]
Lyu, Bin
[3
]
Hoang, Dinh Thai
[4
]
Yi, Changyan
[5
]
机构:
[1] Sun Yat sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen 518107, Peoples R China
[2] Guangdong Prov Key Lab Fire Sci & Intelligent Emer, Guangzhou 510006, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Sch Commun & Informat Engn, Nanjing 210023, Peoples R China
[4] Univ Technol Sydney, Sch Elect & Data Engn, Ultimo, NSW 2007, Australia
[5] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China
基金:
中国国家自然科学基金;
关键词:
NOMA;
Switches;
Autonomous aerial vehicles;
Wireless networks;
Array signal processing;
Data communication;
Trajectory planning;
Trajectory;
Throughput;
Radio frequency;
Aerial reconfigurable intelligent surface;
UAV-assisted wireless networks;
deep reinforcement learning;
INTELLIGENT;
OPTIMIZATION;
DESIGN;
D O I:
10.1109/TWC.2024.3522249
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
In this paper, we consider an aerial reconfigurable intelligent surface (ARIS)-assisted wireless network, where multiple unmanned aerial vehicles (UAVs) collect data from ground users (GUs) by using the non-orthogonal multiple access (NOMA) method. The ARIS provides enhanced channel controllability to improve the NOMA transmissions and reduce the co-channel interference among UAVs. We also propose a novel dual-mode switching scheme, where each UAV equipped with both an ARIS and a radio frequency (RF) transceiver can adaptively perform passive reflection or active transmission. We aim to maximize the overall network throughput by jointly optimizing the UAVs' trajectory planning and operating modes, the ARIS's passive beamforming, and the GUs' transmission control strategies. We propose an optimization-driven hierarchical deep reinforcement learning (O-HDRL) method to decompose it into a series of subproblems. Specifically, the multi-agent deep deterministic policy gradient (MADDPG) adjusts the UAVs' trajectory planning and mode switching strategies, while the passive beamforming and transmission control strategies are tackled by the optimization methods. Numerical results reveal that the O-HDRL efficiently improves the learning stability and reward performance compared to the benchmark methods. Meanwhile, the dual-mode switching scheme is verified to achieve a higher throughput performance compared to the fixed ARIS scheme.
引用
收藏
页码:2530 / 2544
页数:15
相关论文