Rule-Guided DRL for UAV-Assisted Wireless Sensor Networks With No-Fly Zones Safety

被引:0
作者
Bai, Zixuan [1 ]
Shi, Jia [1 ]
Li, Zan [1 ]
Li, Meng [2 ]
Chen, Kwang-Cheng [3 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[2] Xi An Jiao Tong Univ, Inst Hlth & Rehabil Sci, Sch Life Sci & Technol, Xian 710049, Peoples R China
[3] Univ S Florida, Dept Elect Engn, Tampa, FL 33620 USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Safety; Autonomous aerial vehicles; Wireless sensor networks; Data collection; Data models; Packet loss; Training; Wireless sensor network (WSN); unmanned aerial vehicle (UAV); data collection; age of information (AoI); safe reinforcement learning; DATA-COLLECTION; PERFORMANCE ANALYSIS; INFORMATION; AGE; MINIMIZATION; INTERNET; DESIGN; MODEL;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Essential safety constraints are critical to data collection in UAV-assisted wireless sensor networks (WSNs). This paper proposes a dynamic model for such WSNs considering safety of no-fly zones (NFZs) and uncertainty of sensor data. Under the dual objectives of minimizing the age of information (AoI) and reducing packet loss, we formulate the UAV trajectory optimization problem with NFZ constraints as a constraint Markov decision process (CMDP). To integrate safety into the exploring and learning process, we propose a rule-guided deep reinforcement learning (RG-DRL) scheme, providing safety guarantee and robust performance for the off-policy DRL agent. On one hand, a rule-based AI method is employed to strategically guide the convergence of the DRL agent. On the other hand, a novel safety technique named differentiable algorithm enabled safety layer (DASLayer) is introduced to navigate the UAV in complex environments without violating safety constraints. By relaxing the artificial potential field (APF) algorithm into a differentiable manner, the DASLayer is compatible with common neural architectures, facilitating simultaneous training alongside the neural network. Statistical results demonstrate the effectiveness of the proposed method, ensuring zero safety constraint violations, and yielding significant improvements in AoI (25%) and packet loss reduction (30%).
引用
收藏
页码:1268 / 1280
页数:13
相关论文
共 66 条
[1]   Average Peak Age-of-Information Minimization in UAV-Assisted IoT Networks [J].
Abd-Elmagid, Mohamed A. ;
Dhillon, Harpreet S. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) :2003-2008
[2]  
Achiam J, 2017, PR MACH LEARN RES, V70
[3]  
Al-Anbagi I., 2013, 2013 International Conference on Computing, Networking and Communications (ICNC 2013), P802, DOI 10.1109/ICCNC.2013.6504191
[4]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[5]  
Alshiekh M, 2018, AAAI CONF ARTIF INTE, P2669
[6]  
Altman E., 2021, Constrained Markov Decision Processes
[7]   Age of Information minimization in UAV-aided data collection for WSN and IoT applications: A systematic review [J].
Amodu, Oluwatosin Ahmed ;
Bukar, Umar Ali ;
Mahmood, Raja Azlina Raja ;
Jarray, Chedia ;
Othman, Mohamed .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2023, 216
[8]   THE IF-PROBLEM IN AUTOMATIC DIFFERENTIATION [J].
BECK, T ;
FISCHER, H .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1994, 50 (1-3) :119-131
[9]   Efficient 3-D Placement of an Aerial Base Station in Next Generation Cellular Networks [J].
Bor-Yaliniz, R. Irem ;
El-Keyi, Amr ;
Yanikomeroglu, Haiti .
2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
[10]   Internet of Things (IoT) and Agricultural Unmanned Aerial Vehicles (UAVs) in smart farming: A comprehensive review [J].
Boursianis, Achilles D. ;
Papadopoulou, Maria S. ;
Diamantoulakis, Panagiotis ;
Liopa-Tsakalidi, Aglaia ;
Barouchas, Pantelis ;
Salahas, George ;
Karagiannidis, George ;
Wan, Shaohua ;
Goudos, Sotirios K. .
INTERNET OF THINGS, 2022, 18