Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning

被引:31
作者
Heiberg, Amalie [1 ]
Larsen, Thomas Nakken [2 ]
Meyer, Eivind [3 ]
Rasheed, Adil [2 ]
San, Omer [4 ]
Varagnolo, Damiano [2 ]
机构
[1] Equinor, Stavanger, Norway
[2] Norwegian Univ Sci & Technol, Dept Engn Cybernet, Trondheim, Norway
[3] Tech Univ Munich, Inst Informat, Munich, Germany
[4] Oklahoma State Univ, Sch Mech & Aerosp Engn, Stillwater, OK USA
关键词
Deep reinforcement learning; Collision avoidance; Path following; Collision risk indices; Machine learning controller; Autonomous surface vehicle; SHIP COLLISION-AVOIDANCE; NAVIGATION; DOMAINS;
D O I
10.1016/j.neunet.2022.04.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Autonomous systems are becoming ubiquitous and gaining momentum within the marine sector. Since the electrification of transport is happening simultaneously, autonomous marine vessels can reduce environmental impact, lower costs, and increase efficiency. Although close monitoring is still required to ensure safety, the ultimate goal is full autonomy. One major milestone is to develop a control system that is versatile enough to handle any weather and encounter that is also robust and reliable. Additionally, the control system must adhere to the International Regulations for Preventing Collisions at Sea (COLREGs) for successful interaction with human sailors. Since the COLREGs were written for the human mind to interpret, they are written in ambiguous prose and therefore not machine-readable or verifiable. Due to these challenges and the wide variety of situations to be tackled, classical model-based approaches prove complicated to implement and computationally heavy. Within machine learning (ML), deep reinforcement learning (DRL) has shown great potential for a wide range of applications. The model-free and self-learning properties of DRL make it a promising candidate for autonomous vessels. In this work, a subset of the COLREGs is incorporated into a DRL-based path following and obstacle avoidance system using collision risk theory. The resulting autonomous agent dynamically interpolates between path following and COLREG-compliant collision avoidance in the training scenario, isolated encounter situations, and AIS-based simulations of real-world scenarios. (C) 2022 The Author(s). Published by Elsevier Ltd.
引用
收藏
页码:17 / 33
页数:17
相关论文
共 69 条
[1]   Towards the Verification of Safety-critical Autonomous Systems in Dynamic Environments [J].
Aniculaesei, Adina ;
Arnsberger, Daniel ;
Howar, Falk ;
Rausch, Andreas .
ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2016, (232) :79-90
[2]  
[Anonymous], 2004, PROC IFAC C CONTROL
[3]  
[Anonymous], 2008, Collision avoidance for unmanned surface vehicles
[4]   Evolutionary trajectory planning of ships in navigation traffic areas [J].
Smierzchalski R. .
Journal of Marine Science and Technology, 1999, 4 (1) :1-6
[5]  
[Anonymous], 2016, ARXIV161207139
[6]  
AUTOSHIP, 2020, Autonomous Shipping Initiative for European Waters
[7]   DOA tracking for seamless connectivity in beamformed IoT-based drones [J].
Balamurugan, N. M. ;
Mohan, Senthilkumar ;
Adimoolam, M. ;
John, A. ;
reddy, Thippa G. ;
Wang, Weizheng .
COMPUTER STANDARDS & INTERFACES, 2022, 79
[8]  
Benjamin M. R., 2006, J FIELD ROBOT, V29, P554, DOI [10.1002/rob, DOI 10.1002/ROB]
[9]   Review of Health Prognostics and Condition Monitoring of Electronic Components [J].
Bhargava, Cherry ;
Sharma, Pardeep Kumar ;
Senthilkumar, Mohan ;
Padmanaban, Sanjeevikumar ;
Ramachandaramurthy, Vigna K. ;
Leonowicz, Zbigniew ;
Blaabjerg, Frede ;
Mitolo, Massimo .
IEEE ACCESS, 2020, 8 :75163-75183
[10]  
Casalino G, 2009, OCEANS-IEEE, P1479