Coverage path planning for maritime search and rescue using reinforcement learning

被引:72
作者
Ai, Bo [1 ]
Jia, Maoxin [1 ]
Xu, Hanwen [1 ]
Xu, Jiangling [2 ]
Wen, Zhen [1 ]
Li, Benshuai [1 ,3 ]
Zhang, Dan [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Geodesy & Geomat, Qingdao 266590, Peoples R China
[2] State Ocean Adm, North China Sea Marine Forecasting Ctr, Qingdao 266100, Peoples R China
[3] Qingdao Yuehai Informat Serv Co Ltd, Qingdao 266590, Peoples R China
基金
中国国家自然科学基金;
关键词
Maritime search and rescue; Reinforcement learning; Coverage path planning; Intelligent planning; APPROXIMATION; ALGORITHMS;
D O I
10.1016/j.oceaneng.2021.110098
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In maritime search and rescue (SAR), the planning of the search path will directly affect the efficiency of searching for people overboard in the search area. However, traditional SAR decision-making schemes often adopt a fixed search path planning mode, but the limits are poor flexibility, low efficiency, and insufficient intelligence. This paper plans a search path with the shortest time-consuming and priority coverage of highprobability areas, considering complete coverage of maritime SAR areas and avoiding maritime obstacles. Firstly, a maritime SAR environment model is built using marine environmental field data and electronic charts. Secondly, an autonomous coverage path planning model for maritime SAR is proposed based on reinforcement learning, in which a reward function with multiple constraints is designed to guide the navigation action of the vessel agent. In the iterative training process of the path planning model, the random action selection probability is dynamically adjusted by the nonlinear action selection policy to ensure the stable convergence of the model. Finally, the experimental verification is conducted in different small-scale maritime SAR simulation scenarios. The results indicate that the search path can cover the high-probability areas preferentially with lower repeated coverage and shorter path length compared with other path planning algorithms.
引用
收藏
页数:11
相关论文
共 36 条
[1]   A Constraint Optimization Approach for the Allocation of Multiple Search Units in Search and Rescue Operations [J].
Abi-Zeid, Irene ;
Nilo, Oscar ;
Lamontagne, Luc .
INFOR, 2011, 49 (01) :15-30
[2]   Path planning for robotic demining: Robust sensor-based coverage of unstructured environments and probabilistic methods [J].
Acar, EU ;
Choset, H ;
Zhang, YG ;
Schervish, M .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2003, 22 (7-8) :441-466
[3]   An Intelligent Decision Algorithm for the Generation of Maritime Search and Rescue Emergency Response Plans [J].
Ai, Bo ;
Li, Benshuai ;
Gao, Song ;
Xu, Jiangling ;
Shang, Hengshuai .
IEEE ACCESS, 2019, 7 :155835-155850
[4]   Approximation algorithms for lawn mowing and milling [J].
Arkin, EM ;
Fekete, SP ;
Mitchell, JSB .
COMPUTATIONAL GEOMETRY-THEORY AND APPLICATIONS, 2000, 17 (1-2) :25-50
[5]  
Balch T, 2000, P WORKSH SENS MOT IE
[6]   An operational search and rescue model for the Norwegian Sea and the North Sea [J].
Breivik, Oyvind ;
Allen, Arthur A. .
JOURNAL OF MARINE SYSTEMS, 2008, 69 (1-2) :99-113
[7]  
Chae H, 2017, IEEE INT C INTELL TR
[8]  
Choset H., 1997, P INT C FIELD SERV R
[9]  
Frost J., 1999, Response, V17, P1
[10]   A survey on coverage path planning for robotics [J].
Galceran, Enric ;
Carreras, Marc .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (12) :1258-1276