Controlling a cargo ship without human experience using deep Q-network

被引:9
作者
Chen, Chen [1 ]
Ma, Feng [2 ,3 ]
Liu, Jialun [2 ,3 ]
Negenborn, Rudy R. [3 ,4 ]
Liu, Yuanchang [5 ]
Yan, Xinping [2 ,3 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Wuhan Univ Technol, Intelligent Transportat Syst Ctr, Wuhan 430063, Peoples R China
[3] Natl Engn Res Ctr Water Transport Safety, Wuhan, Peoples R China
[4] Delft Univ Technol, Dept Maritime & Transport Technol, Delft, Netherlands
[5] UCL, Dept Mech Engn, Torrington Pl, London, England
基金
国家重点研发计划;
关键词
Deep Q-network; reinforcement learning; artificial intelligence; autonomous ships; UNMANNED SURFACE VEHICLE; COLLISION-AVOIDANCE; OBSTACLE AVOIDANCE; CONTROL-SYSTEM; LEVEL CONTROL; MODEL; NAVIGATION; DESIGN;
D O I
10.3233/JIFS-200754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human experience is regarded as an indispensable part of artificial intelligence in the process of controlling or decision making for autonomous cargo ships. In this paper, a novel Deep Q-Network-based (DQN) approach is proposed, which performs satisfactorily in controlling a cargo ship automatically without any human experience. At the very beginning, we use the model of KRISO Very Large Crude Carrier (KVLCC2) to describe a cargo ship. To manipulate this ship has to conquer great inertia and relatively insufficient driving force. Subsequently, customary waterways, regulations, conventions are described with Artificial Potential Field and value-functions in DQN. Based on this, the artificial intelligence of planning and controlling a cargo ship can be obtained by undertaking sufficient training, which can control the ship directly, while avoiding collisions, keeping its position in the middle of the route as much as possible. In simulation experiments, it is demonstrated that such an approach performs better than manual works and other traditional methods in most conditions, which makes the proposed method a promising solution in improving the autonomy level of cargo ships.
引用
收藏
页码:7363 / 7379
页数:17
相关论文
共 43 条
[1]  
[Anonymous], 2018, ADV NEURAL INFORM PR
[2]  
[Anonymous], 2011, HDB MARINE CRAFT HYD
[3]  
[Anonymous], 2006, P 14 MED C CONTR AUT
[4]  
[Anonymous], 2016, P NIPS
[5]   Vector-based navigation using grid-like representations in artificial agents [J].
Banino, Andrea ;
Barry, Caswell ;
Uria, Benigno ;
Blundell, Charles ;
Lillicrap, Timothy ;
Mirowski, Piotr ;
Pritzel, Alexander ;
Chadwick, Martin J. ;
Degris, Thomas ;
Modayil, Joseph ;
Wayne, Greg ;
Soyer, Hubert ;
Viola, Fabio ;
Zhang, Brian ;
Goroshin, Ross ;
Rabinowitz, Neil ;
Pascanu, Razvan ;
Beattie, Charlie ;
Petersen, Stig ;
Sadik, Amir ;
Gaffney, Stephen ;
King, Helen ;
Kavukcuoglu, Koray ;
Hassabis, Demis ;
Hadsell, Raia ;
Kumaran, Dharshan .
NATURE, 2018, 557 (7705) :429-+
[6]  
Bellemare M. G., 2012, INVESTIGATING CONTIN
[7]   RL-based path planning for an over-actuated floating vehicle under disturbances [J].
Blekas, Konstantinos ;
Vlachos, Kostas .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 101 :93-102
[8]   A review on improving the autonomy of unmanned surface vehicles through intelligent collision avoidance manoeuvres [J].
Campbell, S. ;
Naeem, W. ;
Irwin, G. W. .
ANNUAL REVIEWS IN CONTROL, 2012, 36 (02) :267-283
[9]  
Campbell S., 2012, IFAC Proc., V45, P386, DOI [10.3182/20120919-3-IT-2046.00066, DOI 10.3182/20120919-3-IT-2046.00066]
[10]   Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning [J].
Carlucho, Ignacio ;
De Paula, Mariano ;
Wang, Sen ;
Petillot, Yvan ;
Acosta, Gerardo G. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 :71-86