A path planning strategy for marine vehicles based on deep reinforcement learning and data-driven dynamic flow fields prediction

被引:1
作者
Sang, Qiming [1 ,2 ,3 ]
Tian, Yu [1 ,2 ]
Jin, Qianlong [1 ,2 ,3 ]
Yu, Jiancheng [1 ,2 ]
机构
[1] Chinese Acad Sci, State Key Lab Robot, Shenyang Inst Automat, Shenyang 110169, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110169, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
来源
2021 6TH INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL AND ROBOTICS ENGINEERING, CACRE | 2021年
基金
中国国家自然科学基金;
关键词
marine vehicle; path planning; deep reinforcement learning; dynamic mode decomposition; sensing optimization; OPTIMIZATION;
D O I
10.1109/CACRE52464.2021.9501367
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a strategy for planning a path of a marine vehicle in dynamic flow fields. This strategy composes of two modules: deep reinforcement learning based path planning and dynamic mode decomposition (DMD) based flow fields prediction. The path planning module employs the deep reinforcement learning algorithm of proximal policy optimization (PPO) to implement the time-optimal path planning of a marine vehicle in predicted spatially-temporally dynamic flow fields, where the long short-term memory (LSTM) is introduced to address the partially observable issue. The objective of the flow prediction module is to provide the path planning module with predicted dynamic flow fields. In the flow prediction module, the data-driven method of DMD is used to learn the low-dimensional model of flow dynamics and make future predictions. And a network of marine vehicles with flow sensing capability are adopted to generate data of flow fields for the on-line DMD learning and prediction, where their flow sensing locations are optimized by the deep reinforcement learning algorithm of deep-Q learning with the aim at minimizing the reconstruction error of the flow field with the sparse in-situ point flow observations by the swarm of marine vehicles. The strategy is implemented in computer simulations, where the flow data outputted by a numerical ocean model is utilized to test the strategy. The simulation results demonstrate the performance of the proposed strategy.
引用
收藏
页码:466 / 471
页数:6
相关论文
共 22 条
[1]   Progress toward autonomous ocean sampling networks [J].
Curtin, Thomas B. ;
Bellingham, James G. .
DEEP-SEA RESEARCH PART II-TOPICAL STUDIES IN OCEANOGRAPHY, 2009, 56 (3-5) :62-67
[2]   Mission Planning and Decision Support for Underwater Glider Networks: A Sampling on-Demand Approach [J].
Ferri, Gabriele ;
Cococcioni, Marco ;
Alvarez, Alberto .
SENSORS, 2016, 16 (01)
[3]   Information-driven robotic sampling in the coastal ocean [J].
Fossum, Trygve Olav ;
Eidsvik, Jo ;
Ellingsen, Ingrid ;
Alver, Morten Omholt ;
Fragoso, Glaucia Moreira ;
Johnsen, Geir ;
Mendes, Renato ;
Ludvigsen, Martin ;
Rajan, Kanna .
JOURNAL OF FIELD ROBOTICS, 2018, 35 (07) :1101-1121
[4]  
Garau B, 2005, IEEE INT CONF ROBOT, P194
[5]   Geostatistical Prediction of Ocean Outfall Plume Characteristics Based on an Autonomous Underwater Vehicle Regular Paper [J].
Gregorio Ramos, Patrcia Alexandra .
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
[6]   Nonlinear optimization of autonomous undersea vehicle sampling strategies for oceanographic data-assimilation [J].
Heaney, Kevin D. ;
Gawarkiewicz, Glen ;
Duda, Timothy F. ;
Lermusiaux, Pierre F. J. .
JOURNAL OF FIELD ROBOTICS, 2007, 24 (06) :437-448
[7]  
Leonard NE, 2016, SPRINGER HANDBOOK OF OCEAN ENGINEERING, P441
[8]  
Lermusiaux PFJ, 2016, SPRINGER HANDBOOK OF OCEAN ENGINEERING, P481
[9]   Real-time Modelling of Tidal Current for Navigating Underwater Glider Sensing Networks [J].
Liang, Xiaolin ;
Wu, Wencen ;
Chang, Dongsik ;
Zhang, Fumin .
ANT 2012 AND MOBIWIS 2012, 2012, 10 :1121-1126
[10]   Time-optimal path planning in dynamic flows using level set equations: theory and schemes [J].
Lolla, Tapovan ;
Lermusiaux, Pierre F. J. ;
Ueckermann, Mattheus P. ;
Haley, Patrick J., Jr. .
OCEAN DYNAMICS, 2014, 64 (10) :1373-1397