Deep Reinforcement Learning and NOMA-Based Multi-Objective RIS-Assisted IS-UAV-TNs: Trajectory Optimization and Beamforming Design

被引:63
作者
Guo, Kefeng [1 ]
Wu, Min [1 ]
Li, Xingwang [2 ]
Song, Houbing [3 ]
Kumar, Neeraj [4 ,5 ,6 ,7 ,8 ]
机构
[1] Space Engn Univ, Sch Space Informat, Beijing 101407, Peoples R China
[2] Henan Polytech Univ, Sch Phys & Elect Informat Engn, Jiaozuo 454000, Peoples R China
[3] Univ Maryland Baltimore Cty UMBC, Dept Informat Syst, Baltimore, MD 21250 USA
[4] Deemed Univ, Thapar Inst Engn & Technol, Dept Comp Sci & Engn, Patiala 147004, India
[5] Univ Petr & Energy Studies, Sch Comp Sci, Dehra Dun 248007, India
[6] Lebanese Amer Univ, Dept Elect & Comp Engn, Beirut 11022801, Lebanon
[7] Chandigarh Univ, Comp Sci & Engn Dept, Mohali 160012, India
[8] King Abdulaziz Univ, Fac Comp & IT, Jeddah 21589, Saudi Arabia
基金
中国国家自然科学基金;
关键词
NOMA; Optimization; Wireless communication; Autonomous aerial vehicles; Array signal processing; Deep learning; Satellite broadcasting; Deep reinforcement learning (DRL); reconfigurable intelligent surface (RIS); integrated satellite-unmanned aerial vehicle-terrestrial networks (IS-UAV-TNs); non-orthogonal multiple access (NOMA); multi-objective DDPG; NONORTHOGONAL MULTIPLE-ACCESS; PHYSICAL LAYER SECURITY; SECRECY PERFORMANCE; OUTAGE PROBABILITY; NETWORKS; SYSTEMS; RELAY; IRS; ALLOCATION;
D O I
10.1109/TITS.2023.3267607
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In this paper, we discuss the co-optimized performance of multi-reconfigurable intelligent surface (RIS)-assisted integrated satellite-unmanned aerial vehicle-terrestrial network (IS-UAV-TN), where the multiple vehicle users are applied to the network under consideration. The performance optimization of IS-UAV-TNs faces two major challenges: one is the obstacles in the transmission path and the other is the highly dynamic communication environment caused by the UAV movement for the multiple ground vehicle users. To tackle these above issues efficiently, we will install RIS on the UAV for the purpose of reshaping the wireless transmission path. In addition, non-orthogonal multiple access (NOMA) protocols are considered as a new paradigm to address spectrum shortage and enhance connection quality. Considering the UAV energy consumption, the satellite transmission beamforming matrix and RIS phase shift configuration, a multi-objective optimization problem is proposed to maximize the system achievable rate and minimize the UAV energy consumption during a specific mission. On this foundation, to facilitate the online decision problem, the deep reinforcement learning (DRL) algorithm is utilized to achieve real-time interaction with the communication environment. A multi-objective deep deterministic policy gradient (MO-DDPG) algorithm is proposed to search for sub-optimal solutions about the learning problem of multi-objective control policies in IS-UAV-TNs. Experimental results show that the method can simultaneously consider three optimization objectives and effectively adjust the optimal update policy according to the settings of different weight parameters.
引用
收藏
页码:10197 / 10210
页数:14
相关论文
共 56 条
[1]   Outage Performance of Cognitive Hybrid Satellite-Terrestrial Networks With Interference Constraint [J].
An, Kang ;
Lin, Min ;
Zhu, Wei-Ping ;
Huang, Yongming ;
Zheng, Gan .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (11) :9397-9404
[2]   Secure Transmission in Cognitive Satellite Terrestrial Networks [J].
An, Kang ;
Lin, Min ;
Ouyang, Jian ;
Zhu, Wei-Ping .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2016, 34 (11) :3025-3037
[3]   Channel Estimation and Detection in Satellite Communication Systems [J].
Arti, M. K. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2016, 65 (12) :10173-10179
[4]   Multi-IRS and Multi-UAV-Assisted MEC System for 5G/6G Networks: Efficient Joint Trajectory Optimization and Passive Beamforming Framework [J].
Asim, Muhammad ;
ELAffendi, Mohammed ;
Abd El-Latif, Ahmed A. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (04) :4553-4564
[5]   A Survey on Non-Orthogonal Multiple Access for 5G Networks: Research Challenges and Future Trends [J].
Ding, Zhiguo ;
Lei, Xianfu ;
Karagiannidis, George K. ;
Schober, Robert ;
Yuan, Jinhong ;
Bhargava, Vijay K. .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (10) :2181-2195
[6]   Application of Non-Orthogonal Multiple Access in LTE and 5G Networks [J].
Ding, Zhiguo ;
Liu, Yuanwei ;
Choi, Jinho ;
Sun, Qi ;
Elkashlan, Maged ;
I, Chih-Lin ;
Poor, H. Vincent .
IEEE COMMUNICATIONS MAGAZINE, 2017, 55 (02) :185-191
[7]   Integrated Satellite Multiple Two-Way Relay Networks: Secrecy Performance Under Multiple Eves and Vehicles With Non-Ideal Hardware [J].
Guo, Kefeng ;
Li, Xingwang ;
Alazab, Mamoun ;
Jhaveri, Rutvij H. ;
An, Kang .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (02) :1307-1318
[8]   Performance Analysis of Hybrid Satellite-Terrestrial Cooperative Networks With Relay Selection [J].
Guo, Kefeng ;
Lin, Min ;
Zhang, Bangning ;
Wang, Jun-Bo ;
Wu, Yongpeng ;
Zhu, Wei-Ping ;
Cheng, Julian .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (08) :9053-9067
[9]   On the Performance of LMS Communication With Hardware Impairments and Interference [J].
Guo, Kefeng ;
Lin, Min ;
Zhang, Bangning ;
Zhu, Wei-Ping ;
Wang, Jun-Bo ;
Tsiftsis, Theodoros A. .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (02) :1490-1505
[10]   Epileptogenic zone localization using a new automatic quantitative analysis based on normal brain glucose metabolism database [J].
Guo, Kun ;
Yuan, Menghui ;
Wei, Longxiao ;
Lu, Jie .
INTERNATIONAL JOURNAL OF NEUROSCIENCE, 2021, 131 (02) :128-134