Resource Allocation for Multi-Target Radar Tracking via Constrained Deep Reinforcement Learning

被引:5
|
作者
Lu, Ziyang [1 ]
Gursoy, M. Cenk [1 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Syracuse, NY 13244 USA
基金
美国国家科学基金会;
关键词
Constrained optimization; extended Kalman filter; multi-target tracking; radar; reinforcement learning; resource allocation; COGNITIVE RADAR;
D O I
10.1109/TCCN.2023.3304634
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In this paper, multi-target tracking in a radar system is considered, and adaptive radar resource management is addressed. In particular, time management in tracking multiple maneuvering targets subject to budget constraints is studied with the goal to minimize the total tracking cost of all targets (or equivalently to maximize the tracking accuracies). The constrained optimization of the dwell time allocation to each target is addressed via deep Q-network (DQN) based reinforcement learning. In the proposed constrained deep reinforcement learning (CDRL) algorithm, both the parameters of the DQN and the dual variable are learned simultaneously. The proposed CDRL framework consists of two components, namely online CDRL and offline CDRL. Training a DQN in the deep reinforcement learning algorithm usually requires a large amount of data, which may not be available in a target tracking task due to the scarcity of measurements. We address this challenge by proposing an offline CDRL framework, in which the algorithm evolves in a virtual environment generated based on the current observations and prior knowledge of the environment. Simulation results show that both offline CDRL and online CDRL are critical for effective target tracking and resource utilization. Offline CDRL provides more training data to stabilize the learning process and the online component can sense the change in the environment and make the corresponding adaptation. Furthermore, a hybrid CDRL algorithm that combines offline CDRL and online CDRL is proposed to reduce the computational burden by performing offline CDRL only periodically to stabilize the training process of the online CDRL.
引用
收藏
页码:1677 / 1690
页数:14
相关论文
共 50 条
  • [21] Deep learning algorithm based on MobileNet for multi-target tracking
    Xue J.-T.
    Ma R.-H.
    Hu C.-F.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (08): : 1991 - 1996
  • [22] Radar Resource Management for Multi-Target Tracking Using Model Predictive Control
    de Boer, Thies
    Schope, Max Ian
    Driessen, Hans
    2021 IEEE 24TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2021, : 270 - 277
  • [23] User and resource allocation in latency constrained Xhaul via reinforcement learning
    Chughtai, Mohsan Niaz
    Noor, Shabnam
    Laurinavicius, Ignas
    Assimakopoulos, Philippos
    Gomes, Nathan J.
    Zhu, Huiling
    Wang, Jiangzhou
    Zheng, Xi
    Yan, Qi
    JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2023, 15 (04) : 219 - 228
  • [24] Memory-based deep reinforcement learning for cognitive radar target tracking waveform resource management
    Qin, Jiahao
    Zhu, Mengtao
    Pan, Zesi
    Li, Yunjie
    Li, Yan
    IET RADAR SONAR AND NAVIGATION, 2023, 17 (12): : 1822 - 1836
  • [25] Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs*
    Zhang, Tianle
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8794 - 8800
  • [26] Radar pulse interleaving for multi-target tracking
    Elshafei, M
    Sherali, HD
    Smith, JC
    NAVAL RESEARCH LOGISTICS, 2004, 51 (01) : 72 - 94
  • [27] Cooperative Multi-Target Tracking with MIMO Radar
    Zhang, Liang
    Chen, Dong
    Yang, Fan
    Tao, HaiJun
    Chen, WeiGuo
    2017 16TH INTERNATIONAL CONFERENCE ON OPTICAL COMMUNICATIONS & NETWORKS (ICOCN 2017), 2017,
  • [28] Joint power and time width allocation in collocated MIMO radar for multi-target tracking
    Li, Zhengjie
    Xie, Junwei
    Zhang, Haowei
    IET RADAR SONAR AND NAVIGATION, 2020, 14 (05): : 686 - 693
  • [29] Inverse Reinforcement Learning for Generalized Labeled Multi-Bernoulli Multi-Target Tracking
    Thomas, Ryan W.
    Larson, Jordan D.
    2021 IEEE AEROSPACE CONFERENCE (AEROCONF 2021), 2021,
  • [30] Adaptive Resource Management in Multi-target Tracking for Collocated MIMO Radar with Simultaneous Multibeam
    Wei, Xuejiao
    Cheng, Ting
    Peng, Han
    2019 22ND INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2019), 2019,