A Human-Machine Agent Based on Active Reinforcement Learning for Target Classification in Wargame

被引：5

作者：

Chen, Li ^{[1
,2
]}

Zhang, Yulong ^{[2
,3
]}

Feng, Yanghe ^{[2
]}

Zhang, Longfei ^{[2
]}

Liu, Zhong ^{[2
]}

机构：

[1] Army Logist Acad, Chongqing 400000, Peoples R China

[2] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China

[3] 31002unit, Beijing 100000, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Target recognition; Man-machine systems; Data models; Costs; Radar imaging; Predictive models; Active reinforcement learning; human experience guidance; human-machine agent; machine data learning; target classification; SYSTEMS; NETWORK;

D O I：

10.1109/TNNLS.2023.3236944

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To meet the requirements of high accuracy and low cost of target classification in modern warfare, and lay the foundation for target threat assessment, the article proposes a human-machine agent for target classification based on active reinforcement learning (TCARL_H-M), inferring when to introduce human experience guidance for model and how to autonomously classify detected targets into predefined categories with equipment information. To simulate different levels of human guidance, we set up two modes for the model: the easier-to-obtain but low-value-type cues simulated by Mode 1 and the labor-intensive but high-value class labels simulated by Mode 2. In addition, to analyze the respective roles of human experience guidance and machine data learning in target classification tasks, the article proposes a machine-based learner (TCARL_M) with zero human participation and a human-based interventionist with full human guidance (TCARL_H). Finally, based on the simulation data from a wargame, we carried out performance evaluation and application analysis for the proposed models in terms of target prediction and target classification, respectively, and the obtained results demonstrate that TCARL_H-M can not only greatly save labor costs, but achieve more competitive classification accuracy compared with our TCARL_M, TCARL_H, a purely supervised model-long short-term memory network (LSTM), a classic active learning algorithm-Query By Committee (QBC), and the common active learning model-uncertainty sampling (Uncertainty).

引用

页码：9858 / 9870

页数：13

共 43 条

[1] Active Learning for Estimating Reachable Sets for Systems With Unknown Dynamics [J].

Chakrabarty, Ankush ;

Danielson, Claus ;

Di Cairano, Stefano ;

Raghunathan, Arvind .

IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (04) :2531-2542

[2] Generative Deep Neural Networks for Inverse Materials Design Using Backpropagation and Active Learning [J].

Chen, Chun-Teh ;

Gu, Grace X. .

ADVANCED SCIENCE, 2020, 7 (05)

[3] Active one-shot learning by a deep Q-network strategy [J].

Chen, Li ;

Huang, Honglan ;

Feng, Yanghe ;

Cheng, Guangquan ;

Huang, Jincai ;

Liu, Zhong .

NEUROCOMPUTING, 2020, 383 :324-335

[4] Target Classification Using the Deep Convolutional Networks for SAR Images [J].

Chen, Sizhe ;

Wang, Haipeng ;

Xu, Feng ;

Jin, Ya-Qiu .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (08) :4806-4817

[5] CNN-Based Target Recognition and Identification for Infrared Imaging in Defense Systems [J].

d'Acremont, Antoine ;

Fablet, Ronan ;

Baussard, Alexandre ;

Quin, Guillaume .

SENSORS, 2019, 19 (09)

[6] Infrared Small-Target Detection Using Multiscale Gray Difference Weighted Image Entropy [J].

Deng, He ;

Sun, Xianping ;

Liu, Maili ;

Ye, Chaohui ;

Zhou, Xin .

IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2016, 52 (01) :60-72

[7] Convolutional Neural Network With Data Augmentation for SAR Target Recognition [J].

Ding, Jun ;

Chen, Bo ;

Liu, Hongwei ;

Huang, Mengyuan .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (03) :364-368

[8] Factorized discriminative conditional variational auto-encoder for radar HRRP target recognition [J].

Du, Chuan ;

Chen, Bo ;

Xu, Bin ;

Guo, Dandan ;

Liu, Hongwei .

SIGNAL PROCESSING, 2019, 158 :176-189

[9] Unsupervised Anomaly Detection With LSTM Neural Networks [J].

Ergen, Tolga ;

Kozat, Suleyman Serdar .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) :3127-3141

[10]

Felder R.M., 2009, ASQ High. Educ. Brief, V2, P1, DOI DOI 10.4324/9780203891414-15

← 1 2 3 4 5 →