Underwater Target Tracking Based on Hierarchical Software-Defined Multi-AUV Reinforcement Learning: A Multi-AUV Advantage-Attention Actor-Critic Approach
被引:0
|
作者:
Zhu, Shengchao
论文数: 0引用数: 0
h-index: 0
机构:
Hohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R ChinaHohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R China
Zhu, Shengchao
[1
]
Han, Guangjie
论文数: 0引用数: 0
h-index: 0
机构:
Hohai Univ, Key Lab Maritime Intelligent Network Informat Tech, Minist Educ, Nanjing 211100, Peoples R ChinaHohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R China
Han, Guangjie
[2
]
Lin, Chuan
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China
Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R ChinaHohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R China
Lin, Chuan
[3
,4
]
Tao, Qiuzi
论文数: 0引用数: 0
h-index: 0
机构:
Dalian Univ Technol, Software Coll, Dalian 116024, Peoples R ChinaHohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R China
Tao, Qiuzi
[5
]
机构:
[1] Hohai Univ, Sch Comp & Informat, Nanjing 211100, Peoples R China
[2] Hohai Univ, Key Lab Maritime Intelligent Network Informat Tech, Minist Educ, Nanjing 211100, Peoples R China
[3] Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China
[4] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China
[5] Dalian Univ Technol, Software Coll, Dalian 116024, Peoples R China
With the rapid development of underwater robots, underwater communication techniques, etc., the Autonomous Underwater Vehicle (AUV) cluster network has emerged as a candidate paradigm to perform underwater civil and military applications, e.g., underwater target tracking. In this paper, we focus on how to utilize networking and multi-agent artificial intelligence technique to improve underwater target tracking. In particular, to improve the flexibility and scalability of the AUV cluster network, we employ Software-Defined Networking (SDN) and Centralized Training with Decentralized Execution (CTDE)-based Multi-Agent Reinforcement Learning (MARL) technologies, to propose a Hierarchical Software-Defined Multiple AUVs Reinforcement Learning (HSD-MARL) framework. For the MARL mechanism in HSD-MARL, we propose an advantage-attention mechanism and present the architecture of Multi-AUV Advantage-Attention Actor-Critic (MA-A3C), to address slow convergence and poor scalability issues on the AUV cluster network of large-scale. Further, to improve the utilization rate of advantage samples especially when the MA-A3C is utilized to perform AUV cluster network-based underwater tracking, we propose an 'advantage resampling' method based on experience replay buffer. Evaluation results showcase that our proposed approaches can perform exact underwater target tracking based on AUV cluster network systems and outperform some recent research products in terms of convergence speed, tracking accuracy, etc.
机构:
Northwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Zhu, Zixiao
Zhang, Lichuan
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Zhang, Lichuan
Liu, Lu
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Liu, Lu
Wu, Dongwei
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Suixun Elect Technol Co Ltd, Shanghai 200438, Peoples R ChinaNorthwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Wu, Dongwei
Bai, Shuchang
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Bai, Shuchang
Ren, Ranzhen
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Ren, Ranzhen
Geng, Wenlong
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R ChinaNorthwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
机构:
Hohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R ChinaHohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R China
Hou, Yun
Han, Guangjie
论文数: 0引用数: 0
h-index: 0
机构:
Hohai Univ, Dept Internet Things Engn, Changzhou 213022, Peoples R ChinaHohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R China
Han, Guangjie
Zhang, Fan
论文数: 0引用数: 0
h-index: 0
机构:
Hohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R ChinaHohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R China
Zhang, Fan
Lin, Chuan
论文数: 0引用数: 0
h-index: 0
机构:
Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China
Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R ChinaHohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R China
Lin, Chuan
Peng, Jinlin
论文数: 0引用数: 0
h-index: 0
机构:
Natl Innovat Inst Def Technol, Artificial Intelligence Res Ctr, Beijing 100081, Peoples R ChinaHohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R China
Peng, Jinlin
Liu, Li
论文数: 0引用数: 0
h-index: 0
机构:
Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214126, Peoples R ChinaHohai Univ, Changzhou Key Lab Internet Things Technol Intellig, Changzhou 213022, Peoples R China