Dynamic Navigation and Area Assignment of Multiple USVs Based on Multi-Agent Deep Reinforcement Learning

被引：9

作者：

Wen, Jiayi ^{[1
]}

Liu, Shaoman ^{[1
]}

Lin, Yejin ^{[1
]}

机构：

[1] Dalian Maritime Univ, Lab Intelligent Marine Vehicles DMU, Dalian 116026, Peoples R China

来源：

SENSORS | 2022年 / 22卷 / 18期

基金：

中国国家自然科学基金;

关键词：

USV; trajectory design; policy gradient; multi-agent deep reinforcement learning; multi-object optimization; SURFACE; TRACKING; VEHICLE;

D O I：

10.3390/s22186942

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

The unmanned surface vehicle (USV) has attracted more and more attention because of its basic ability to perform complex maritime tasks autonomously in constrained environments. However, the level of autonomy of one single USV is still limited, especially when deployed in a dynamic environment to perform multiple tasks simultaneously. Thus, a multi-USV cooperative approach can be adopted to obtain the desired success rate in the presence of multi-mission objectives. In this paper, we propose a cooperative navigating approach by enabling multiple USVs to automatically avoid dynamic obstacles and allocate target areas. To be specific, we propose a multi-agent deep reinforcement learning (MADRL) approach, i.e., a multi-agent deep deterministic policy gradient (MADDPG), to maximize the autonomy level by jointly optimizing the trajectory of USVs, as well as obstacle avoidance and coordination, which is a complex optimization problem usually solved separately. In contrast to other works, we combined dynamic navigation and area assignment to design a task management system based on the MADDPG learning framework. Finally, the experiments were carried out on the Gym platform to verify the effectiveness of the proposed method.

引用

页数：14

共 39 条

[1] [Anonymous], 2013, Robot intelligence technology and applications 2012
[2] Using Reinforcement Learning to Minimize the Probability of Delay Occurrence in Transportation
Cao, Zhiguang
Guo, Hongliang
Song, Wen
Gao, Kaizhou
Chen, Zhenghua
Zhang, Le
Zhang, Xuexi
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) : 2424 - 2436
[3] Liquid State Machine Learning for Resource and Cache Management in LTE-U Unmanned Aerial Vehicle (UAV) Networks
Chen, Mingzhe
Saad, Walid
Yin, Changchuan
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (03) : 1504 - 1517
[4] Mixed-Integer Linear Programming for Optimal Scheduling of Autonomous Vehicle Intersection Crossing
Fayazi, Seyed Alireza
Vahidi, Ardalan
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2018, 3 (03): : 287 - 299
[5] Unmanned Surface Vehicle Collision Avoidance Path Planning in Restricted Waters Using Multi-Objective Optimisation Complying with COLREGs
Gu, Yang
Rong, Zhenwei
Tong, Huzhou
Wang, Jia
Si, Yulin
Yang, Shujie
[J]. SENSORS, 2022, 22 (15)
[6] Global path planning and multi-objective path control for unmanned surface vehicle based on modified particle swarm optimization (PSO) algorithm
Guo, Xinghai
Ji, Mingjun
Zhao, Ziwei
Wen, Dusu
Zhang, Weidan
[J]. OCEAN ENGINEERING, 2020, 216
[7] Karaman S, 2011, IEEE INT CONF ROBOT, P1478
[8] REAL-TIME OBSTACLE AVOIDANCE FOR MANIPULATORS AND MOBILE ROBOTS
KHATIB, O
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1986, 5 (01) : 90 - 98
[9] Real-Time Motion Planning With Applications to Autonomous Urban Driving
Kuwata, Yoshiaki
Teo, Justin
Fiore, Gaston
Karaman, Sertac
Frazzoli, Emilio
How, Jonathan P.
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2009, 17 (05) : 1105 - 1118
[10] Littman ML, 1994, P 11 INT C MACH LEAR, P157, DOI DOI 10.1016/B978-1-55860-335-6.50027-1

← 1 2 3 4 →