Adaptive Sampling Path Planning for a 3D Marine Observation Platform Based on Evolutionary Deep Reinforcement Learning

被引:4
作者
Zhang, Jingjing [1 ]
Liu, Yanlong [1 ]
Zhou, Weidong [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin 150001, Peoples R China
关键词
marine environment observation; evolutionary learning; reinforcement learning; path planning; deep learning; AUV;
D O I
10.3390/jmse11122313
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Adaptive sampling of the marine environment may improve the accuracy of marine numerical prediction models. This study considered adaptive sampling path optimization for a three-dimensional (3D) marine observation platform, leading to a path-planning strategy based on evolutionary deep reinforcement learning. The low sampling efficiency of the reinforcement learning algorithm is improved by evolutionary learning. The combination of these two components as a new algorithm has become a current research trend. We first combined the evolutionary algorithm with different reinforcement learning algorithms to verify the effectiveness of the combination of algorithms with different strategies. Experimental results indicate that the fusion of the two algorithms based on a maximum-entropy strategy is more effective for adaptive sampling using a 3D marine observation platform. Data assimilation experiments indicate that adaptive sampling data from a 3D mobile observation platform based on evolutionary deep reinforcement learning improves the accuracy of marine environment numerical prediction systems.
引用
收藏
页数:23
相关论文
共 60 条
[51]   Robust Position Control of an Over-actuated Underwater Vehicle under Model Uncertainties and Ocean Current Effects Using Dynamic Sliding Mode Surface and Optimal Allocation Control [J].
Vu, Mai The ;
Le, Tat-Hien ;
Thanh, Ha Le Nhu Ngoc ;
Huynh, Tuan-Tu ;
Van, Mien ;
Hoang, Quoc-Dong ;
Do, Ton Duc .
SENSORS, 2021, 21 (03) :1-25
[52]   Sliding mode heading control for AUV based on continuous hybrid model-free and model-based reinforcement learning [J].
Wang, Dianrui ;
Shen, Yue ;
Wan, Junhe ;
Sha, Qixin ;
Li, Guangliang ;
Chen, Guanzhong ;
He, Bo .
APPLIED OCEAN RESEARCH, 2022, 118
[53]   Cooperative collision avoidance for unmanned surface vehicles based on improved genetic algorithm [J].
Wang, Hongjian ;
Fu, Zhongjian ;
Zhou, Jiajia ;
Fu, Mingyu ;
Ruan, Li .
OCEAN ENGINEERING, 2021, 222
[54]   Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning [J].
Wang, Shuwu ;
Ma, Feng ;
Yan, Xinping ;
Wu, Peng ;
Liu, Yuanchang .
APPLIED OCEAN RESEARCH, 2021, 110
[55]   State Super Sampling Soft Actor-Critic Algorithm for Multi-AUV Hunting in 3D Underwater Environment [J].
Wang, Zhuo ;
Sui, Yancheng ;
Qin, Hongde ;
Lu, Hao .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (07)
[56]   Autonomous Underwater Vehicle Path Planning Method of Soft Actor-Critic Based on Game Training [J].
Wang, Zhuo ;
Lu, Hao ;
Qin, Hongde ;
Sui, Yancheng .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (12)
[57]   MARKOV DECISION-PROCESSES [J].
WHITE, CC ;
WHITE, DJ .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1989, 39 (01) :1-16
[58]  
Wiering M. A., 2012, Adaptation, Learning, and Optimization, V12, P729, DOI DOI 10.1007/978-3-642-27645-3
[59]  
Zhang Yi, 2020, P 19 INT C AUT AG MU, P2077
[60]   Improved Multi-Agent Deep Deterministic Policy Gradient for Path Planning-Based Crowd Simulation [J].
Zheng, Shangfei ;
Liu, Hong .
IEEE ACCESS, 2019, 7 :147755-147770