Resource Allocation for Multi-Target Radar Tracking via Constrained Deep Reinforcement Learning

被引：5

作者：

Lu, Ziyang ^{[1
]}

Gursoy, M. Cenk ^{[1
]}

机构：

[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Syracuse, NY 13244 USA

来源：

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING | 2023年 / 9卷 / 06期

基金：

美国国家科学基金会;

关键词：

Constrained optimization; extended Kalman filter; multi-target tracking; radar; reinforcement learning; resource allocation; COGNITIVE RADAR;

D O I：

10.1109/TCCN.2023.3304634

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

In this paper, multi-target tracking in a radar system is considered, and adaptive radar resource management is addressed. In particular, time management in tracking multiple maneuvering targets subject to budget constraints is studied with the goal to minimize the total tracking cost of all targets (or equivalently to maximize the tracking accuracies). The constrained optimization of the dwell time allocation to each target is addressed via deep Q-network (DQN) based reinforcement learning. In the proposed constrained deep reinforcement learning (CDRL) algorithm, both the parameters of the DQN and the dual variable are learned simultaneously. The proposed CDRL framework consists of two components, namely online CDRL and offline CDRL. Training a DQN in the deep reinforcement learning algorithm usually requires a large amount of data, which may not be available in a target tracking task due to the scarcity of measurements. We address this challenge by proposing an offline CDRL framework, in which the algorithm evolves in a virtual environment generated based on the current observations and prior knowledge of the environment. Simulation results show that both offline CDRL and online CDRL are critical for effective target tracking and resource utilization. Offline CDRL provides more training data to stabilize the learning process and the online component can sense the change in the environment and make the corresponding adaptation. Furthermore, a hybrid CDRL algorithm that combines offline CDRL and online CDRL is proposed to reduce the computational burden by performing offline CDRL only periodically to stabilize the training process of the online CDRL.

引用

页码：1677 / 1690

页数：14

共 50 条

[21] Deep learning algorithm based on MobileNet for multi-target tracking
Xue J.-T.
Ma R.-H.
Hu C.-F.
Kongzhi yu Juece/Control and Decision, 2021, 36 (08): : 1991 - 1996
[22] Radar Resource Management for Multi-Target Tracking Using Model Predictive Control
de Boer, Thies
Schope, Max Ian
Driessen, Hans
2021 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2021, : 270 - 277
[23] User and resource allocation in latency constrained Xhaul via reinforcement learning
Chughtai, Mohsan Niaz
Noor, Shabnam
Laurinavicius, Ignas
Assimakopoulos, Philippos
Gomes, Nathan J.
Zhu, Huiling
Wang, Jiangzhou
Zheng, Xi
Yan, Qi
JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2023, 15 (04) : 219 - 228
[24] Memory-based deep reinforcement learning for cognitive radar target tracking waveform resource management
Qin, Jiahao
Zhu, Mengtao
Pan, Zesi
Li, Yunjie
Li, Yan
IET RADAR SONAR AND NAVIGATION, 2023, 17 (12): : 1822 - 1836
[25] Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs*
Zhang, Tianle
Liu, Zhen
Pu, Zhiqiang
Yi, Jianqiang
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8794 - 8800
[26] Radar pulse interleaving for multi-target tracking
Elshafei, M
Sherali, HD
Smith, JC
NAVAL RESEARCH LOGISTICS, 2004, 51 (01) : 72 - 94
[27] Cooperative Multi-Target Tracking with MIMO Radar
Zhang, Liang
Chen, Dong
Yang, Fan
Tao, HaiJun
Chen, WeiGuo
2017 16TH INTERNATIONAL CONFERENCE ON OPTICAL COMMUNICATIONS & NETWORKS (ICOCN 2017), 2017,
[28] Joint power and time width allocation in collocated MIMO radar for multi-target tracking
Li, Zhengjie
Xie, Junwei
Zhang, Haowei
IET RADAR SONAR AND NAVIGATION, 2020, 14 (05): : 686 - 693
[29] Inverse Reinforcement Learning for Generalized Labeled Multi-Bernoulli Multi-Target Tracking
Thomas, Ryan W.
Larson, Jordan D.
2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
[30] Adaptive Resource Management in Multi-target Tracking for Collocated MIMO Radar with Simultaneous Multibeam
Wei, Xuejiao
Cheng, Ting
Peng, Han
2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,

← 1 2 3 4 5 →