Joint Optimization of Trajectory and User Association via Reinforcement Learning for UAV-Aided Data Collection in Wireless Networks

被引：30

作者：

Chen, Gong ^{[1
,2
,3
]}

Zhai, Xiangping Bryce ^{[1
,2
]}

Li, Congduan ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

[2] Collaborat Innovat Ctr Novel Software Technol & In, Nanjing 210023, Jiangsu, Peoples R China

[3] Sun Yat Sen Univ, Sch Elect & Commun Engn, Shenzhen 518107, Peoples R China

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2023年 / 22卷 / 05期

基金：

美国国家科学基金会;

关键词：

Trajectory; Optimization; Games; Throughput; Wireless networks; Resource management; Interference; UAV trajectory design; fair throughputs; energy-efficiency; coalition formation games; multi-agent deep reinforcement learning; ENERGY-EFFICIENT; COMMUNICATION; ALLOCATION; DESIGN; SPECTRUM; MEC;

D O I：

10.1109/TWC.2022.3216049

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unmanned Aerial Vehicles (UAVs) can be used as aerial base stations for data collection in next-generation wireless networks due to their high adaptability and maneuverability. This paper investigates the scenario where multiple UAVs cooperatively fly over heterogeneous ground users (GUs) and collect data without a central controller. With the consideration of signal-to-interference-and-noise ratio (SINR) and fairness among users, we jointly optimize the trajectories of UAVs and the GUs associations to maximize the total throughput and energy efficiency. We formulate the long-term optimization problem as a decentralized partially observed Markov decision processes (DEC-POMDP) and derive an approach combining the coalition formation game (CFG) and multi-agent deep reinforcement learning (MADRL). We first formulate the discrete association scheduling problem as a non-cooperative theoretical game and use the CFG algorithm to achieve a decentralized scheme converging to Nash equilibrium (NE). Then, a MARL-based technique is developed to optimize the trajectories and energy consumption continuously in a centralized-training but decentralized-execution manner. Simulation results demonstrate that the proposed algorithm outperforms the commonly used schemes in the literature, regarding the fair throughput and energy consumption in a distributed manner.

引用

页码：3128 / 3143

页数：16

共 39 条

[1] Internet of Things: A Survey on Enabling Technologies, Protocols, and Applications [J].

Al-Fuqaha, Ala ;

Guizani, Mohsen ;

Mohammadi, Mehdi ;

Aledhari, Mohammed ;

Ayyash, Moussa .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (04) :2347-2376

[2] Optimal LAP Altitude for Maximum Coverage [J].

Al-Hourani, Akram ;

Kandeepan, Sithamparanathan ;

Lardner, Simon .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572

[3] Joint Task Assignment and Spectrum Allocation in Heterogeneous UAV Communication Networks: A Coalition Formation Game-Theoretic Approach [J].

Chen, Jiaxin ;

Wu, Qihui ;

Xu, Yuhua ;

Qi, Nan ;

Guan, Xin ;

Zhang, Yuli ;

Xue, Zhen .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :440-452

[4] Deep Reinforcement Learning for Internet of Things: A Comprehensive Survey [J].

Chen, Wuhui ;

Qiu, Xiaoyu ;

Cai, Ting ;

Dai, Hong-Ning ;

Zheng, Zibin ;

Zhang, Yan .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (03) :1659-1692

[5] UAV-Relaying-Assisted Secure Transmission With Caching [J].

Cheng, Fen ;

Gui, Guan ;

Zhao, Nan ;

Chen, Yunfei ;

Tang, Jie ;

Sari, Hikmet .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (05) :3140-3153

[6] Trajectory Design and Access Control for Air-Ground Coordinated Communications System With Multiagent Deep Reinforcement Learning [J].

Ding, Ruijin ;

Xu, Yadong ;

Gao, Feifei ;

Shen, Xuemin .

IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08) :5785-5798

[7] 3D UAV Trajectory Design and Frequency Band Allocation for Energy-Efficient and Fair Communication: A Deep Reinforcement Learning Approach [J].

Ding, Ruijin ;

Gao, Feifei ;

Shen, Xuemin .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (12) :7796-7809

[8] Energy-Efficient UAV-Enabled Data Collection via Wireless Charging: A Reinforcement Learning Approach [J].

Fu, Shu ;

Tang, Yujie ;

Wu, Yuan ;

Zhang, Ning ;

Gu, Huaxi ;

Chen, Chen ;

Liu, Min .

IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (12) :10209-10219

[9] Energy-Aware Trajectory Design for Outage Minimization in UAV-Assisted Communication Systems [J].

Gupta, Nishant ;

Mishra, Deepak ;

Agarwal, Satyam .

IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (03) :1751-1763

[10]

Ji J., 2022, IEEE Trans. Mob. Comput.

← 1 2 3 4 →