Decision-Making for the Autonomous Navigation of Maritime Autonomous Surface Ships Based on Scene Division and Deep Reinforcement Learning

被引：81

作者：

Zhang, Xinyu ^{[1
]}

Wang, Chengbo ^{[1
,2
]}

Liu, Yuanchang ^{[3
]}

Chen, Xiang ^{[4
]}

机构：

[1] Dalian Maritime Univ, Minist Transportat, Key Lab Maritime Dynam Simulat & Control, Dalian 116026, Peoples R China

[2] Dalian Maritime Univ, Marine Engn Coll, Dalian 116026, Peoples R China

[3] UCL, Dept Mech Engn, Torrington Pl, London WC1E 7JE, England

[4] Dept Civil Environm & Geomat Engn, London WC1E 6BT, England

来源：

SENSORS | 2019年 / 19卷 / 18期

关键词：

decision-making; autonomous navigation; collision avoidance; scene division; deep reinforcement learning; maritime autonomous surface ships;

D O I：

10.3390/s19184055

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

This research focuses on the adaptive navigation of maritime autonomous surface ships (MASSs) in an uncertain environment. To achieve intelligent obstacle avoidance of MASSs in a port, an autonomous navigation decision-making model based on hierarchical deep reinforcement learning is proposed. The model is mainly composed of two layers: the scene division layer and an autonomous navigation decision-making layer. The scene division layer mainly quantifies the sub-scenarios according to the International Regulations for Preventing Collisions at Sea (COLREG). This research divides the navigational situation of a ship into entities and attributes based on the ontology model and Protege language. In the decision-making layer, we designed a deep Q-learning algorithm utilizing the environmental model, ship motion space, reward function, and search strategy to learn the environmental state in a quantized sub-scenario to train the navigation strategy. Finally, two sets of verification experiments of the deep reinforcement learning (DRL) and improved DRL algorithms were designed with Rizhao port as a study case. Moreover, the experimental data were analyzed in terms of the convergence trend, iterative path, and collision avoidance effect. The results indicate that the improved DRL algorithm could effectively improve the navigation safety and collision avoidance.

引用

页数：18

共 22 条

[1] A study on the collision avoidance of a ship using neural networks and fuzzy logic [J].

Ahn, Jin-Hyeong ;

Rhee, Key-Pyo ;

You, Young-Jun .

APPLIED OCEAN RESEARCH, 2012, 37 :162-173

[2]

[Anonymous], 2016, ARXIV PREPRINT ARXIV

[3]

[Anonymous], 2016, thesis

[4] Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vessels [J].

Cheng, Yin ;

Zhang, Weidong .

NEUROCOMPUTING, 2018, 272 :63-73

[5]

EMSA, 2018, ANN OV MAR CAS INC

[6] An autonomous dynamic collision avoidance control method for unmanned surface vehicle in unknown ocean environment [J].

Fan, Yunsheng ;

Sun, Xiaojie ;

Wang, Guofeng .

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (02)

[7]

Geng X., 2017, THESIS

[8] Motion Plan of Maritime Autonomous Surface Ships by Dynamic Programming for Collision Avoidance and Speed Optimization [J].

Geng, Xiongfei ;

Wang, Yongcai ;

Wang, Ping ;

Zhang, Baochen .

SENSORS, 2019, 19 (02)

[9] Deep-Learned Collision Avoidance Policy for Distributed Multiagent Navigation [J].

Long, Pinxin ;

Liu, Wenxi ;

Pan, Jia .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02) :656-663

[10] Grid Path Planning with Deep Reinforcement Learning: Preliminary Results [J].

Panov, Aleksandr, I ;

Yakovlev, Konstantin S. ;

Suvorov, Roman .

8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 :347-353

← 1 2 3 →