A Human-Machine Agent Based on Active Reinforcement Learning for Target Classification in Wargame

被引:4
|
作者
Chen, Li [1 ,2 ]
Zhang, Yulong [2 ,3 ]
Feng, Yanghe [2 ]
Zhang, Longfei [2 ]
Liu, Zhong [2 ]
机构
[1] Army Logist Acad, Chongqing 400000, Peoples R China
[2] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[3] 31002unit, Beijing 100000, Peoples R China
关键词
Task analysis; Target recognition; Man-machine systems; Data models; Costs; Radar imaging; Predictive models; Active reinforcement learning; human experience guidance; human-machine agent; machine data learning; target classification; SYSTEMS; NETWORK;
D O I
10.1109/TNNLS.2023.3236944
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To meet the requirements of high accuracy and low cost of target classification in modern warfare, and lay the foundation for target threat assessment, the article proposes a human-machine agent for target classification based on active reinforcement learning (TCARL_H-M), inferring when to introduce human experience guidance for model and how to autonomously classify detected targets into predefined categories with equipment information. To simulate different levels of human guidance, we set up two modes for the model: the easier-to-obtain but low-value-type cues simulated by Mode 1 and the labor-intensive but high-value class labels simulated by Mode 2. In addition, to analyze the respective roles of human experience guidance and machine data learning in target classification tasks, the article proposes a machine-based learner (TCARL_M) with zero human participation and a human-based interventionist with full human guidance (TCARL_H). Finally, based on the simulation data from a wargame, we carried out performance evaluation and application analysis for the proposed models in terms of target prediction and target classification, respectively, and the obtained results demonstrate that TCARL_H-M can not only greatly save labor costs, but achieve more competitive classification accuracy compared with our TCARL_M, TCARL_H, a purely supervised model-long short-term memory network (LSTM), a classic active learning algorithm-Query By Committee (QBC), and the common active learning model-uncertainty sampling (Uncertainty).
引用
收藏
页码:9858 / 9870
页数:13
相关论文
共 50 条
  • [31] Active learning-based hyperspectral image classification: a reinforcement learning approach
    Patel, Usha
    Patel, Vibha
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (02): : 2461 - 2486
  • [32] Learning agent based on reinforcement learning
    Li, N.
    Gao, Y.
    Lu, X.
    Chen, S.F.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2001, 38 (09):
  • [33] Soft Exoskeleton Glove for Hand Assistance Based on Human-machine Interaction and Machine Learning
    Chen, Xiaoshi
    Gong, Li
    Zheng, Lirong
    Zou, Zhuo
    PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 324 - 329
  • [34] Deep Learning for EMG-based Human-Machine Interaction: A Review
    Dezhen Xiong
    Daohui Zhang
    Xingang Zhao
    Yiwen Zhao
    IEEE/CAAJournalofAutomaticaSinica, 2021, 8 (03) : 512 - 533
  • [35] A Generic Human-Machine Annotation Framework Based on Dynamic Cooperative Learning
    Zhang, Yue
    Michi, Andrea
    Wagner, Johannes
    Andre, Elisabeth
    Schuller, Bjorn
    Weninger, Felix
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (03) : 1230 - 1239
  • [36] Multi-Agent Active Perception Based on Reinforcement Learning and POMDP
    Selimovic, Tarik
    Peti, Marijana
    Bogdan, Stjepan
    IEEE ACCESS, 2024, 12 : 48004 - 48016
  • [37] Design of Outdoor Space Based on Human-machine Interaction and Deep Learning
    Li J.
    Pan J.
    Zhou G.
    Computer-Aided Design and Applications, 2024, 21 (s7): : 88 - 103
  • [38] Deep Learning for EMG-based Human-Machine Interaction: A Review
    Xiong, Dezhen
    Zhang, Daohui
    Zhao, Xingang
    Zhao, Yiwen
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 8 (03) : 512 - 533
  • [39] Human-Machine Interaction Issues in Quality Control Based on Online Image Classification
    Lughofer, Edwin
    Smith, James E.
    Tahir, Muhammad Atif
    Caleb-Solly, Praminda
    Eitzinger, Christian
    Sannen, Davy
    Nuttin, Marnix
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2009, 39 (05): : 960 - 971
  • [40] Human-Machine Learning for Intelligent Aircraft Systems
    Rubin, Stuart H.
    Lee, Gordon
    AUTONOMOUS AND INTELLIGENT SYSTEMS, 2011, 6752 : 331 - 342