Improved reinforcement learning for collision-free local path planning of dynamic obstacle

被引:18
作者
Yang, Xiao [1 ]
Han, Qilong [1 ]
机构
[1] Harbin Engn Univ, Sch Comp Sci & Technol, Nantong St, Harbin 150001, Heilongjiang, Peoples R China
关键词
Artificial intelligence; Intelligent ship; Dynamic obstacle avoidance and path planning; Exploration strategies; Actor-critic; AVOIDANCE; ALGORITHM; SIMULATION; SHIPS; MODEL;
D O I
10.1016/j.oceaneng.2023.115040
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
The rapid development of artificial intelligence has driven the transformation of traditional industry. The proposal for an intelligent ship reflects the development of the shipping industry in the direction of intelligence. Safe maritime transportation is an essential branch of the intelligent ship industry. In this context, applying various intelligent algorithms in ship dynamic obstacle avoidance has attracted the attention and discussion of scholars. Due to the complexity and diversity of ship navigation, environments cannot be described by a definite mathematical model. Reinforcement learning has certain advantages in solving path planning in a complex environment. Traditional mathematical algorithms and swarm intelligence algorithms need mathematical models to constrain boundary conditions when solving complex problems. However, some complex dynamic environments in the real world cannot define mathematical models. This paper optimizes the parameter update and exploration strategies of the actor-critic algorithm in reinforcement learning to improve the algorithm's performance. Under complex meteorological conditions, improved reinforcement learning is applied to the dynamic ship offshore channel simulation environment to verify the algorithm's performance.
引用
收藏
页数:13
相关论文
共 55 条
[1]   A survey of safe landing zone detection techniques for autonomous unmanned aerial vehicles (UAVs) [J].
Alam, Md Shah ;
Oluoch, Jared .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 179
[2]   Ship path planning based on Deep Reinforcement Learning and weather forecast [J].
Artusi, Eva .
2021 22ND IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2021), 2021, :258-260
[3]   Frequency domain identification of the Nomoto model to facilitate Kalman filter estimation and PID heading control of a patrol vessel [J].
Banazadeh, A. ;
Ghorbani, M. T. .
OCEAN ENGINEERING, 2013, 72 :344-355
[4]   Mission Integrated Collision Avoidance for USVs using Laser Range Finder [J].
Blaich, Michael ;
Koehler, Steffen ;
Schuster, Michael ;
Schuchhardt, Thomas ;
Reuter, Johannes ;
Tietz, Thomas .
OCEANS 2015 - GENOVA, 2015,
[5]  
Casalino G, 2009, OCEANS-IEEE, P1479
[6]   LifeGuard: An Improvement of Actor-Critic Model with Collision Predictor in Autonomous UAV Navigation [J].
Chansuparp, Manit ;
Jitkajornwanich, Kulsawasd .
APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[7]  
Chen DJ, 2015, 3RD INTERNATIONAL CONFERENCE ON TRANSPORTATION INFORMATION AND SAFETY (ICTIS 2015), P512, DOI 10.1109/ICTIS.2015.7232141
[8]   Deep reinforcement learning based controller for ship navigation [J].
Deraj, Rohit ;
Kumar, R. S. Sanjeev ;
Alam, Md Shadab ;
Somayajula, Abhilash .
OCEAN ENGINEERING, 2023, 273
[9]   Double ant colony algorithm based on dynamic feedback for energy-saving route planning for ships [J].
Dong, Liangxiong ;
Li, Jun ;
Xia, Wei ;
Yuan, Qiang .
SOFT COMPUTING, 2021, 25 (07) :5021-5035
[10]   Ship Pipe Route Design Using Improved A* Algorithm and Genetic Algorithm [J].
Dong, Zongran ;
Bian, Xuanyi .
IEEE ACCESS, 2020, 8 :153273-153296