Underwater Target Tracking Based on Hierarchical Software-Defined Multi-AUV Reinforcement Learning: A Multi-AUV Advantage-Attention Actor-Critic Approach

被引:0
|
作者
Zhu, Shengchao [1 ]
Han, Guangjie [2 ]
Lin, Chuan [3 ,4 ]
Tao, Qiuzi [5 ]
机构
[1] Hohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R China
[2] Hohai Univ, Key Lab Maritime Intelligent Network Informat Tech, Minist Educ, Nanjing 211100, Peoples R China
[3] Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China
[4] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China
[5] Dalian Univ Technol, Software Coll, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
AUV cluster network system; advantage attention; advantage resampling; software-defined network; tracking targets; INTERNET; NETWORK; MANAGEMENT; NAVIGATION; SCHEME; IOT;
D O I
10.1109/TMC.2024.3437376
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of underwater robots, underwater communication techniques, etc., the Autonomous Underwater Vehicle (AUV) cluster network has emerged as a candidate paradigm to perform underwater civil and military applications, e.g., underwater target tracking. In this paper, we focus on how to utilize networking and multi-agent artificial intelligence technique to improve underwater target tracking. In particular, to improve the flexibility and scalability of the AUV cluster network, we employ Software-Defined Networking (SDN) and Centralized Training with Decentralized Execution (CTDE)-based Multi-Agent Reinforcement Learning (MARL) technologies, to propose a Hierarchical Software-Defined Multiple AUVs Reinforcement Learning (HSD-MARL) framework. For the MARL mechanism in HSD-MARL, we propose an advantage-attention mechanism and present the architecture of Multi-AUV Advantage-Attention Actor-Critic (MA-A3C), to address slow convergence and poor scalability issues on the AUV cluster network of large-scale. Further, to improve the utilization rate of advantage samples especially when the MA-A3C is utilized to perform AUV cluster network-based underwater tracking, we propose an 'advantage resampling' method based on experience replay buffer. Evaluation results showcase that our proposed approaches can perform exact underwater target tracking based on AUV cluster network systems and outperform some recent research products in terms of convergence speed, tracking accuracy, etc.
引用
收藏
页码:13639 / 13653
页数:15
相关论文
共 28 条
  • [1] Underwater Target Tracking Based on Interrupted Software-Defined Multi-AUV Reinforcement Learning: A Multi-AUV Time-Saving MARL Approach
    Zhu, Shengchao
    Han, Guangjie
    Lin, Chuan
    Zhang, Yu
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 2124 - 2136
  • [2] Multi-AUV cooperative target search and tracking in unknown underwater environment
    Cao, Xiang
    Sun, Hongbing
    Jan, Gene Eu
    OCEAN ENGINEERING, 2018, 150 : 1 - 11
  • [3] Potential field hierarchical reinforcement learning approach for target search by multi-AUV in 3-D underwater environments
    Cao, Xiang
    Sun, Hongbing
    Guo, Liqiang
    INTERNATIONAL JOURNAL OF CONTROL, 2020, 93 (07) : 1677 - 1683
  • [4] An Efficient Multi-AUV Cooperative Navigation Method Based on Hierarchical Reinforcement Learning
    Zhu, Zixiao
    Zhang, Lichuan
    Liu, Lu
    Wu, Dongwei
    Bai, Shuchang
    Ren, Ranzhen
    Geng, Wenlong
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (10)
  • [5] Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning
    Cai, Lei
    Sun, Qiankun
    Xu, Tao
    Ma, Yukun
    Chen, Zhenxue
    IEEE ACCESS, 2020, 8 : 39273 - 39284
  • [6] Multi-AUV Cooperative Underwater Multi-Target Tracking Based on Dynamic-Switching-Enabled Multi-Agent Reinforcement Learning
    Wang, Shengbo
    Lin, Chuan
    Han, Guangjie
    Zhu, Shengchao
    Li, Zhixian
    Wang, Zhenyu
    Ma, Yunpeng
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) : 4296 - 4311
  • [7] State Super Sampling Soft Actor-Critic Algorithm for Multi-AUV Hunting in 3D Underwater Environment
    Wang, Zhuo
    Sui, Yancheng
    Qin, Hongde
    Lu, Hao
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (07)
  • [8] A fuzzy-based potential field hierarchical reinforcement learning approach for target hunting by multi-AUV in 3-D underwater environments
    Cao, Xiang
    Zuo, Fen
    INTERNATIONAL JOURNAL OF CONTROL, 2021, 94 (05) : 1334 - 1343
  • [9] Distributional Soft Actor-Critic-Based Multi-AUV Cooperative Pursuit for Maritime Security Protection
    Hou, Yun
    Han, Guangjie
    Zhang, Fan
    Lin, Chuan
    Peng, Jinlin
    Liu, Li
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (06) : 6049 - 6060
  • [10] Multi-AUV Cooperative Target Hunting based on Improved Potential Field in Underwater Environment
    Cao, Xiang
    Sun, Changyin
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 118 - 122