Adaptive Sampling Path Planning for a 3D Marine Observation Platform Based on Evolutionary Deep Reinforcement Learning

被引:4
作者
Zhang, Jingjing [1 ]
Liu, Yanlong [1 ]
Zhou, Weidong [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
关键词
marine environment observation; evolutionary learning; reinforcement learning; path planning; deep learning; AUV;
D O I
10.3390/jmse11122313
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Adaptive sampling of the marine environment may improve the accuracy of marine numerical prediction models. This study considered adaptive sampling path optimization for a three-dimensional (3D) marine observation platform, leading to a path-planning strategy based on evolutionary deep reinforcement learning. The low sampling efficiency of the reinforcement learning algorithm is improved by evolutionary learning. The combination of these two components as a new algorithm has become a current research trend. We first combined the evolutionary algorithm with different reinforcement learning algorithms to verify the effectiveness of the combination of algorithms with different strategies. Experimental results indicate that the fusion of the two algorithms based on a maximum-entropy strategy is more effective for adaptive sampling using a 3D marine observation platform. Data assimilation experiments indicate that adaptive sampling data from a 3D mobile observation platform based on evolutionary deep reinforcement learning improves the accuracy of marine environment numerical prediction systems.
引用
收藏
页数:23
相关论文
共 60 条
[41]  
Ezeora OS, 2016, 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING (EEEIC)
[42]   PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning [J].
Sartoretti, Guillaume ;
Kerr, Justin ;
Shi, YunFei ;
Wagner, Glenn ;
Kumar, T. K. Satish ;
Koenig, Sven ;
Choset, Howie .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) :2378-2385
[43]  
Silver D, 2014, PR MACH LEARN RES, V32
[44]  
Singh Y., 2017, Marine Navigation, P399
[45]  
Solichudin S., 2020, INT J ELECT COMPUTER, V10, P3596, DOI 10.11591/ijece.v10i4.pp3596-3604
[46]   Adaptive sampling with an autonomous underwater vehicle in static marine environments [J].
Stankiewicz, Paul ;
Tan, Yew T. ;
Kobilarov, Marin .
JOURNAL OF FIELD ROBOTICS, 2021, 38 (04) :572-597
[47]   Evolving neural networks through augmenting topologies [J].
Stanley, KO ;
Miikkulainen, R .
EVOLUTIONARY COMPUTATION, 2002, 10 (02) :99-127
[48]  
Tamar A., 2016, Advances in Neural Information Processing Systems, P2154
[49]   A deep-learning-based surrogate model for data assimilation in dynamic subsurface flow problems [J].
Tang, Meng ;
Liu, Yimin ;
Durlofsky, Louis J. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 413
[50]  
Terasawa R, 2020, IEEE INT CONF ROBOT, P9548, DOI [10.1109/ICRA40945.2020.9196883, 10.1109/icra40945.2020.9196883]