RL-based path planning for an over-actuated floating vehicle under disturbances

被引:16
作者
Blekas, Konstantinos [1 ]
Vlachos, Kostas [1 ]
机构
[1] Univ Ioannina, Dept Comp Sci & Engn, GR-45110 Ioannina, Greece
关键词
Reinforcement learning; Over-actuated control; Marine vehicle; Autonomous navigation;
D O I
10.1016/j.robot.2017.12.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the use of reinforcement learning for the path planning of an autonomous triangular marine platform in unknown environments under various environmental disturbances. The marine platform is over-actuated, i.e. it has more control inputs than degrees of freedom. The proposed approach uses a high-level online least-squared policy iteration scheme for value function approximation in order to estimate sub-optimal policy. The chosen action is considered as the desired input to a fast and efficient low-level velocity controller. We evaluate our approach in a simulated environment, including the dynamic model of the platform, the dynamics and limitations of the actuators, and the presence of wind, wave, and sea current disturbances. Simulation results are presented that demonstrate the performance of the proposed approach. Despite the model dynamics, the actuation dynamics and constrains, and the environmental disturbances, the presented results are promising. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 21 条
[1]  
[Anonymous], 2005, P 22 INT C MACH LEAR, DOI DOI 10.1145/1102351.1102377
[2]  
[Anonymous], 1965, HOERNER FLUID DYNAMI
[3]  
[Anonymous], 2020, Reinforcement Learning, An Introduction
[4]  
Antonelli D, 2014, SPRINGER TRAC ADV RO, V94, P47, DOI 10.1007/978-3-319-02934-4_3
[5]  
Busoniu L, 2010, P AMER CONTR CONF, P486
[6]   A behavior-based scheme using reinforcement learning for autonomous underwater vehicles [J].
Carreras, M ;
Yuh, J ;
Batlle, J ;
Ridao, P .
IEEE JOURNAL OF OCEANIC ENGINEERING, 2005, 30 (02) :416-427
[7]  
Fossen Thor I, 1994, Guidance and Control of Ocean Vehicles
[8]  
Kawano H., 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, P996
[9]   Least-squares policy iteration [J].
Lagoudakis, MG ;
Parr, R .
JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (06) :1107-1149
[10]  
Schuitema E., 2010, P 22 BEN C ART INT