A Human-Machine Agent Based on Active Reinforcement Learning for Target Classification in Wargame

被引：5

作者：

Chen, Li ^{[1
,2
]}

Zhang, Yulong ^{[2
,3
]}

Feng, Yanghe ^{[2
]}

Zhang, Longfei ^{[2
]}

Liu, Zhong ^{[2
]}

机构：

[1] Army Logist Acad, Chongqing 400000, Peoples R China

[2] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China

[3] 31002unit, Beijing 100000, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Target recognition; Man-machine systems; Data models; Costs; Radar imaging; Predictive models; Active reinforcement learning; human experience guidance; human-machine agent; machine data learning; target classification; SYSTEMS; NETWORK;

D O I：

10.1109/TNNLS.2023.3236944

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To meet the requirements of high accuracy and low cost of target classification in modern warfare, and lay the foundation for target threat assessment, the article proposes a human-machine agent for target classification based on active reinforcement learning (TCARL_H-M), inferring when to introduce human experience guidance for model and how to autonomously classify detected targets into predefined categories with equipment information. To simulate different levels of human guidance, we set up two modes for the model: the easier-to-obtain but low-value-type cues simulated by Mode 1 and the labor-intensive but high-value class labels simulated by Mode 2. In addition, to analyze the respective roles of human experience guidance and machine data learning in target classification tasks, the article proposes a machine-based learner (TCARL_M) with zero human participation and a human-based interventionist with full human guidance (TCARL_H). Finally, based on the simulation data from a wargame, we carried out performance evaluation and application analysis for the proposed models in terms of target prediction and target classification, respectively, and the obtained results demonstrate that TCARL_H-M can not only greatly save labor costs, but achieve more competitive classification accuracy compared with our TCARL_M, TCARL_H, a purely supervised model-long short-term memory network (LSTM), a classic active learning algorithm-Query By Committee (QBC), and the common active learning model-uncertainty sampling (Uncertainty).

引用

页码：9858 / 9870

页数：13

共 43 条

[11] Radar HRRP target recognition with deep networks [J].

Feng, Bo ;

Chen, Bo ;

Liu, Hongwei .

PATTERN RECOGNITION, 2017, 61 :379-393

[12] Target classification in infrared imagery by cross-spectral synthesis using GAN [J].

Ferdous, Syeda Nyma ;

Mostofa, Moktari ;

Osahor, Uche ;

Nasrabadi, Nasser M. .

AUTOMATIC TARGET RECOGNITION XXX, 2020, 11394

[13] Selective sampling using the query by committee algorithm [J].

Freund, Y ;

Seung, HS ;

Shamir, E ;

Tishby, N .

MACHINE LEARNING, 1997, 28 (2-3) :133-168

[14] A Novel Multi-Input Bidirectional LSTM and HMM Based Approach for Target Recognition from Multi-Domain Radar Range Profiles [J].

Gao, Fei ;

Huang, Teng ;

Wang, Jun ;

Sun, Jinping ;

Hussain, Amir ;

Zhou, Huiyu .

ELECTRONICS, 2019, 8 (05)

[15] A New Algorithm for SAR Image Target Recognition Based on an Improved Deep Convolutional Neural Network [J].

Gao, Fei ;

Huang, Teng ;

Sun, Jinping ;

Wang, Jun ;

Hussain, Amir ;

Yang, Erfu .

COGNITIVE COMPUTATION, 2019, 11 (06) :809-824

[16] Visual Saliency Modeling for River Detection in High-Resolution SAR Imagery [J].

Gao, Fei ;

Ma, Fei ;

Wang, Jun ;

Sun, Jinping ;

Yang, Erfu ;

Zhou, Huiyu .

IEEE ACCESS, 2018, 6 :1000-1014

[17]

Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]

[18] LSTM: A Search Space Odyssey [J].

Greff, Klaus ;

Srivastava, Rupesh K. ;

Koutnik, Jan ;

Steunebrink, Bas R. ;

Schmidhuber, Juergen .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) :2222-2232

[19] Variational Temporal Deep Generative Model for Radar HRRP Target Recognition [J].

Guo, Dandan ;

Chen, Bo ;

Chen, Wenchao ;

Wang, Chaojie ;

Liu, Hongwei ;

Zhou, Mingyuan .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 :5795-5809

[20] Active Learning With Multiple Kernels [J].

Hong, Songnam ;

Chae, Jeongmin .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) :2980-2994

← 1 2 3 4 5 →