Collision avoidance for an unmanned surface vehicle using deep reinforcement learning

被引：179

作者：

Woo, Joohyun ^{[1
]}

Kim, Nakwan ^{[2
]}

机构：

[1] Seoul Natl Univ, Inst Engn Res, 1 Gwanak Ro, Seoul 08826, South Korea

[2] Seoul Natl Univ, Res Inst Marine Syst Engn, 1 Gwanak Ro, Seoul 08826, South Korea

来源：

OCEAN ENGINEERING | 2020年 / 199卷

关键词：

Deep reinforcement learning; Collision avoidance; Unmanned surface vehicle; COLREGs; Artificial intelligence;

D O I：

10.1016/j.oceaneng.2020.107001

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

In this paper, a deep reinforcement learning (DRL)-based collision avoidance method is proposed for an unmanned surface vehicle (USV). This approach is applicable to the decision-making stage of collision avoidance, which determines whether the avoidance is necessary, and if so, determines the direction of the avoidance maneuver. To utilize the visual recognition capability of deep neural networks as a tool for analyzing the complex and ambiguous situations that are typically encountered, a grid map representation of the ship encounter situation was suggested. For the composition of the DRL network, we proposed a neural network architecture and semi-Markov decision process model that was specially designed for the USV collision avoidance problem. The proposed DRL network was trained through repeated simulations of collision avoidance. After the training process, the DRL network was implemented in collision avoidance experiments and simulations to evaluate its situation recognition and collision avoidance capability.

引用

页数：16

共 29 条

[21]

Polvara R, 2018, INT CONF UNMAN AIRCR, P115, DOI 10.1109/ICUAS.2018.8453449

[22]

Shixiang Gu, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3389, DOI 10.1109/ICRA.2017.7989385

[23]

Simonyan K., 2014, WORKSHOP INT C LEARN

[24]

Sutton RS, 2018, ADAPT COMPUT MACH LE, P1

[25] Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning [J].

Sutton, RS ;

Precup, D ;

Singh, S .

ARTIFICIAL INTELLIGENCE, 1999, 112 (1-2) :181-211

[26]

Tai L, 2017, IEEE INT C INT ROBOT, P31

[27] Dynamic model identification of unmanned surface vehicles using deep learning network [J].

Woo, Joohyun ;

Park, Jongyoung ;

Yu, Chanwoo ;

Kim, Nakwan .

APPLIED OCEAN RESEARCH, 2018, 78 :123-133

[28] Vision-based obstacle collision risk estimation of an unmanned surface vehicle [J].

Woo, Joohyun ;

Kim, Nakwan .

Journal of Institute of Control, Robotics and Systems, 2015, 21 (12) :1089-1099

[29]

Zhu YK, 2017, INT CONF ACOUST SPEE, P5335, DOI 10.1109/ICASSP.2017.7953175

← 1 2 3 →