Collision avoidance for an unmanned surface vehicle using deep reinforcement learning

被引:179
作者
Woo, Joohyun [1 ]
Kim, Nakwan [2 ]
机构
[1] Seoul Natl Univ, Inst Engn Res, 1 Gwanak Ro, Seoul 08826, South Korea
[2] Seoul Natl Univ, Res Inst Marine Syst Engn, 1 Gwanak Ro, Seoul 08826, South Korea
关键词
Deep reinforcement learning; Collision avoidance; Unmanned surface vehicle; COLREGs; Artificial intelligence;
D O I
10.1016/j.oceaneng.2020.107001
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In this paper, a deep reinforcement learning (DRL)-based collision avoidance method is proposed for an unmanned surface vehicle (USV). This approach is applicable to the decision-making stage of collision avoidance, which determines whether the avoidance is necessary, and if so, determines the direction of the avoidance maneuver. To utilize the visual recognition capability of deep neural networks as a tool for analyzing the complex and ambiguous situations that are typically encountered, a grid map representation of the ship encounter situation was suggested. For the composition of the DRL network, we proposed a neural network architecture and semi-Markov decision process model that was specially designed for the USV collision avoidance problem. The proposed DRL network was trained through repeated simulations of collision avoidance. After the training process, the DRL network was implemented in collision avoidance experiments and simulations to evaluate its situation recognition and collision avoidance capability.
引用
收藏
页数:16
相关论文
共 29 条
[21]  
Polvara R, 2018, INT CONF UNMAN AIRCR, P115, DOI 10.1109/ICUAS.2018.8453449
[22]  
Shixiang Gu, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3389, DOI 10.1109/ICRA.2017.7989385
[23]  
Simonyan K., 2014, WORKSHOP INT C LEARN
[24]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[25]   Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning [J].
Sutton, RS ;
Precup, D ;
Singh, S .
ARTIFICIAL INTELLIGENCE, 1999, 112 (1-2) :181-211
[26]  
Tai L, 2017, IEEE INT C INT ROBOT, P31
[27]   Dynamic model identification of unmanned surface vehicles using deep learning network [J].
Woo, Joohyun ;
Park, Jongyoung ;
Yu, Chanwoo ;
Kim, Nakwan .
APPLIED OCEAN RESEARCH, 2018, 78 :123-133
[28]   Vision-based obstacle collision risk estimation of an unmanned surface vehicle [J].
Woo, Joohyun ;
Kim, Nakwan .
Journal of Institute of Control, Robotics and Systems, 2015, 21 (12) :1089-1099
[29]  
Zhu YK, 2017, INT CONF ACOUST SPEE, P5335, DOI 10.1109/ICASSP.2017.7953175