Reinforcement-Learning-Aided Safe Planning for Aerial Robots to Collect Data in Dynamic Environments

被引:17
作者
Khamidehi, Behzad [1 ]
Sousa, Elvino S. [1 ]
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 1A1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Autonomous aerial vehicles; Sensors; Internet of Things; Data collection; Heuristic algorithms; Batteries; Wireless sensor networks; Deep reinforcement learning (RL); Internet of Things (IoT); path planning; unmanned aerial vehicles (UAVs); COMPLETION-TIME MINIMIZATION; ASSISTED DATA-COLLECTION; TRAJECTORY DESIGN; RESOURCE-ALLOCATION; UAV; COMMUNICATION; NETWORKS; SKY; LTE;
D O I
10.1109/JIOT.2022.3145008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the data collection problem in an Internet of Things (IoT) network where an unmanned aerial vehicle (UAV) is utilized to aggregate data from a set of IoT devices. We formulate the scheduling and path planning problems for the UAV. The goal of the scheduling problem is to find the sequence of nodes that the UAV will visit to complete the data collection task in the shortest possible time, ensuring that it does not run out of energy during its mission. We express this problem as a mixed-integer nonlinear problem and propose an efficient algorithm to solve the aforementioned NP-hard problem in polynomial time. Path planning problem aims to find a collision-free path for the UAV. While the state-of-the-art schemes have focused on solving the path planning problem in static environments, we study the problem in a dynamic environment with moving obstacles. We develop an algorithm that works on both static and dynamic environments. Our method combines deep reinforcement learning (RL) with graph-based global path planning algorithms to find a collision-free path for the UAV. One important advantage of our RL-based method over the existing studies is its map independency, which allows us to transform the agent's learning from one environment to another. Via simulation studies, we show that our method is significantly effective in improving the safety of the path planning algorithms in dynamic environments.
引用
收藏
页码:13901 / 13912
页数:12
相关论文
共 48 条
[1]   Optimal LAP Altitude for Maximum Coverage [J].
Al-Hourani, Akram ;
Kandeepan, Sithamparanathan ;
Lardner, Simon .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2014, 3 (06) :569-572
[2]  
[Anonymous], 2014, GUR OPT REF MAN
[3]   Optimal UAV Route in Wireless Charging Sensor Networks [J].
Baek, Jaeuk ;
Han, Sang Ik ;
Han, Youngnam .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (02) :1327-1335
[4]   Energy-Efficient UAV Routing for Wireless Sensor Networks [J].
Baek, Jaeuk ;
Han, Sang Ik ;
Han, Youngnam .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (02) :1741-1750
[5]   The multiple traveling salesman problem: an overview of formulations and solution procedures [J].
Bektas, T .
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2006, 34 (03) :209-219
[6]   A UAV-Assisted Data Collection for Wireless Sensor Networks: Autonomous Navigation and Scheduling [J].
Bouhamed, Omar ;
Ghazzai, Hakim ;
Besbes, Hichem ;
Massoud, Yehia .
IEEE ACCESS, 2020, 8 :110446-110460
[7]  
Cui HG, 2019, IEEE INT CONF ROBOT, P2090, DOI [10.1109/icra.2019.8793868, 10.1109/ICRA.2019.8793868]
[8]   Energy-Efficient Multi-UAV Data Collection for IoT Networks with Time Deadlines [J].
Ghdiri, Oussama ;
Jaafar, Wael ;
Alfattani, Safwan ;
Ben Abderrazak, Jihene ;
Yanikomeroglu, Halim .
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[9]   Flight Time Minimization of UAV for Data Collection Over Wireless Sensor Networks [J].
Gong, Jie ;
Chang, Tsung-Hui ;
Shen, Chao ;
Chen, Xiang .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (09) :1942-1954
[10]   Outage and Throughput Performance of Half/Full-Duplex UAV-Assisted Co-Operative Relay Networks Over Weibull Fading Channel [J].
Jeganathan, Anandpushparaj ;
Mitali, Gupta ;
Jayakody, Dushantha Nalin K. ;
Muthuchidambaranathan, P. .
WIRELESS PERSONAL COMMUNICATIONS, 2021, 120 (03) :2389-2407