Deep Reinforcement Learning for Joint Trajectory Planning, Transmission Scheduling, and Access Control in UAV-Assisted Wireless Sensor Networks

被引：9

作者：

Luo, Xiaoling ^{[1
,2
]}

Chen, Che ^{[3
,4
]}

Zeng, Chunnian ^{[1
]}

Li, Chengtao ^{[2
]}

Xu, Jing ^{[5
]}

Gong, Shimin ^{[4
]}

机构：

[1] Wuhan Univ Technol, Sch Informat Engn, Wuhan 430070, Peoples R China

[2] China Three Gorges Corp, Wuhan 430010, Peoples R China

[3] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Peoples R China

[4] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen 518107, Peoples R China

[5] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 10期

关键词：

UAV; multi-agent deep reinforcement learning; trajectory planning; access control; RESOURCE-ALLOCATION; DESIGN; COMMUNICATION; OPTIMIZATION; COVERAGE; INTERNET;

D O I：

10.3390/s23104691

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Unmanned aerial vehicles (UAVs) can be used to relay sensing information and computational workloads from ground users (GUs) to a remote base station (RBS) for further processing. In this paper, we employ multiple UAVs to assist with the collection of sensing information in a terrestrial wireless sensor network. All of the information collected by the UAVs can be forwarded to the RBS. We aim to improve the energy efficiency for sensing-data collection and transmission by optimizing UAV trajectory, scheduling, and access-control strategies. Considering a time-slotted frame structure, UAV flight, sensing, and information-forwarding sub-slots are confined to each time slot. This motivates the trade-off study between UAV access-control and trajectory planning. More sensing data in one time slot will take up more UAV buffer space and require a longer transmission time for information forwarding. We solve this problem by a multi-agent deep reinforcement learning approach that takes into consideration a dynamic network environment with uncertain information about the GU spatial distribution and traffic demands. We further devise a hierarchical learning framework with reduced action and state spaces to improve the learning efficiency by exploiting the distributed structure of the UAV-assisted wireless sensor network. Simulation results show that UAV trajectory planning with access control can significantly improve UAV energy efficiency. The hierarchical learning method is more stable in learning and can also achieve higher sensing performance.

引用

页数：22

共 37 条

[21] Deep Reinforcement Learning Based Three-Dimensional Area Coverage With UAV Swarm [J].

Mou, Zhiyu ;

Zhang, Yu ;

Gao, Feifei ;

Wang, Huangang ;

Zhang, Tao ;

Han, Zhu .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (10) :3160-3176

[22] Joint Multi-Domain Resource Allocation and Trajectory Optimization in UAV-Assisted Maritime IoT Networks [J].

Qian, Li Ping ;

Zhang, Hongsen ;

Wang, Qian ;

Wu, Yuan ;

Lin, Bin .

IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) :539-552

[23] Performance Analysis and Optimization of RSMA Enabled UAV-Aided IBL and FBL Communication With Imperfect SIC and CSI [J].

Singh, Sandeep Kumar ;

Agrawal, Kamal ;

Singh, Keshav ;

Chen, Yen-Ming ;

Li, Chih-Peng .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (06) :3714-3732

[24] Ergodic Capacity and Placement Optimization for RSMA-Enabled UAV-Assisted Communication [J].

Singh, Sandeep Kumar ;

Agrawal, Kamal ;

Singh, Keshav ;

Li, Chih-Peng .

IEEE SYSTEMS JOURNAL, 2023, 17 (02) :2586-2589

[25] Non-Orthogonal Multiple Access for Unmanned Aerial Vehicle Assisted Communication [J].

Sohail, Muhammad Farhan ;

Leow, Chee Yen ;

Won, Seunghwan .

IEEE ACCESS, 2018, 6 :22716-22727

[26] UAV Relay-Assisted Emergency Communications in IoT Networks: Resource Allocation and Trajectory Optimization [J].

Tran, Dinh-Hieu ;

Nguyen, Van-Dinh ;

Chatzinotas, Symeon ;

Vu, Thang X. ;

Ottersten, Bjorn .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (03) :1621-1637

[27]

Wang M., 2021, 2021 IEEE 23 INT C H, P961

[28] Trajectory Design for UAV-Based Internet of Things Data Collection: A Deep Reinforcement Learning Approach [J].

Wang, Yang ;

Gao, Zhen ;

Zhang, Jun ;

Cao, Xianbin ;

Zheng, Dezhi ;

Gao, Yue ;

Ng, Derrick Wing Kwan ;

Di Renzo, Marco .

IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (05) :3899-3912

[29] Distributed Federated Deep Reinforcement Learning Based Trajectory Optimization for Air-Ground Cooperative Emergency Networks [J].

Wu, Silei ;

Xu, Wenjun ;

Wang, Fengyu ;

Li, Guojun ;

Pan, Miao .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (08) :9107-9112

[30] Energy-Efficient UAV Backscatter Communication With Joint Trajectory Design and Resource Optimization [J].

Yang, Gang ;

Dai, Rao ;

Liang, Ying-Chang .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (02) :926-941

← 1 2 3 4 →