Underwater Target Tracking Based on Hierarchical Software-Defined Multi-AUV Reinforcement Learning: A Multi-AUV Advantage-Attention Actor-Critic Approach

被引：0

作者：

Zhu, Shengchao ^{[1
]}

Han, Guangjie ^{[2
]}

Lin, Chuan ^{[3
,4
]}

Tao, Qiuzi ^{[5
]}

机构：

[1] Hohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R China

[2] Hohai Univ, Key Lab Maritime Intelligent Network Informat Tech, Minist Educ, Nanjing 211100, Peoples R China

[3] Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China

[4] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China

[5] Dalian Univ Technol, Software Coll, Dalian 116024, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

AUV cluster network system; advantage attention; advantage resampling; software-defined network; tracking targets; INTERNET; NETWORK; MANAGEMENT; NAVIGATION; SCHEME; IOT;

D O I：

10.1109/TMC.2024.3437376

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of underwater robots, underwater communication techniques, etc., the Autonomous Underwater Vehicle (AUV) cluster network has emerged as a candidate paradigm to perform underwater civil and military applications, e.g., underwater target tracking. In this paper, we focus on how to utilize networking and multi-agent artificial intelligence technique to improve underwater target tracking. In particular, to improve the flexibility and scalability of the AUV cluster network, we employ Software-Defined Networking (SDN) and Centralized Training with Decentralized Execution (CTDE)-based Multi-Agent Reinforcement Learning (MARL) technologies, to propose a Hierarchical Software-Defined Multiple AUVs Reinforcement Learning (HSD-MARL) framework. For the MARL mechanism in HSD-MARL, we propose an advantage-attention mechanism and present the architecture of Multi-AUV Advantage-Attention Actor-Critic (MA-A3C), to address slow convergence and poor scalability issues on the AUV cluster network of large-scale. Further, to improve the utilization rate of advantage samples especially when the MA-A3C is utilized to perform AUV cluster network-based underwater tracking, we propose an 'advantage resampling' method based on experience replay buffer. Evaluation results showcase that our proposed approaches can perform exact underwater target tracking based on AUV cluster network systems and outperform some recent research products in terms of convergence speed, tracking accuracy, etc.

引用

页码：13639 / 13653

页数：15

共 28 条

[1] Underwater Target Tracking Based on Interrupted Software-Defined Multi-AUV Reinforcement Learning: A Multi-AUV Time-Saving MARL Approach
Zhu, Shengchao
Han, Guangjie
Lin, Chuan
Zhang, Yu
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 2124 - 2136
[2] Multi-AUV cooperative target search and tracking in unknown underwater environment
Cao, Xiang
Sun, Hongbing
Jan, Gene Eu
OCEAN ENGINEERING, 2018, 150 : 1 - 11
[3] Potential field hierarchical reinforcement learning approach for target search by multi-AUV in 3-D underwater environments
Cao, Xiang
Sun, Hongbing
Guo, Liqiang
INTERNATIONAL JOURNAL OF CONTROL, 2020, 93 (07) : 1677 - 1683
[4] An Efficient Multi-AUV Cooperative Navigation Method Based on Hierarchical Reinforcement Learning
Zhu, Zixiao
Zhang, Lichuan
Liu, Lu
Wu, Dongwei
Bai, Shuchang
Ren, Ranzhen
Geng, Wenlong
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (10)
[5] Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
Cai, Lei
Sun, Qiankun
Xu, Tao
Ma, Yukun
Chen, Zhenxue
IEEE ACCESS, 2020, 8 : 39273 - 39284
[6] Multi-AUV Cooperative Underwater Multi-Target Tracking Based on Dynamic-Switching-Enabled Multi-Agent Reinforcement Learning
Wang, Shengbo
Lin, Chuan
Han, Guangjie
Zhu, Shengchao
Li, Zhixian
Wang, Zhenyu
Ma, Yunpeng
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) : 4296 - 4311
[7] State Super Sampling Soft Actor-Critic Algorithm for Multi-AUV Hunting in 3D Underwater Environment
Wang, Zhuo
Sui, Yancheng
Qin, Hongde
Lu, Hao
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (07)
[8] A fuzzy-based potential field hierarchical reinforcement learning approach for target hunting by multi-AUV in 3-D underwater environments
Cao, Xiang
Zuo, Fen
INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (05) : 1334 - 1343
[9] Distributional Soft Actor-Critic-Based Multi-AUV Cooperative Pursuit for Maritime Security Protection
Hou, Yun
Han, Guangjie
Zhang, Fan
Lin, Chuan
Peng, Jinlin
Liu, Li
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 6049 - 6060
[10] Multi-AUV Cooperative Target Hunting based on Improved Potential Field in Underwater Environment
Cao, Xiang
Sun, Changyin
PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 118 - 122

← 1 2 3 →