A Human-Machine Agent Based on Active Reinforcement Learning for Target Classification in Wargame

被引:4
|
作者
Chen, Li [1 ,2 ]
Zhang, Yulong [2 ,3 ]
Feng, Yanghe [2 ]
Zhang, Longfei [2 ]
Liu, Zhong [2 ]
机构
[1] Army Logist Acad, Chongqing 400000, Peoples R China
[2] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[3] 31002unit, Beijing 100000, Peoples R China
关键词
Task analysis; Target recognition; Man-machine systems; Data models; Costs; Radar imaging; Predictive models; Active reinforcement learning; human experience guidance; human-machine agent; machine data learning; target classification; SYSTEMS; NETWORK;
D O I
10.1109/TNNLS.2023.3236944
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To meet the requirements of high accuracy and low cost of target classification in modern warfare, and lay the foundation for target threat assessment, the article proposes a human-machine agent for target classification based on active reinforcement learning (TCARL_H-M), inferring when to introduce human experience guidance for model and how to autonomously classify detected targets into predefined categories with equipment information. To simulate different levels of human guidance, we set up two modes for the model: the easier-to-obtain but low-value-type cues simulated by Mode 1 and the labor-intensive but high-value class labels simulated by Mode 2. In addition, to analyze the respective roles of human experience guidance and machine data learning in target classification tasks, the article proposes a machine-based learner (TCARL_M) with zero human participation and a human-based interventionist with full human guidance (TCARL_H). Finally, based on the simulation data from a wargame, we carried out performance evaluation and application analysis for the proposed models in terms of target prediction and target classification, respectively, and the obtained results demonstrate that TCARL_H-M can not only greatly save labor costs, but achieve more competitive classification accuracy compared with our TCARL_M, TCARL_H, a purely supervised model-long short-term memory network (LSTM), a classic active learning algorithm-Query By Committee (QBC), and the common active learning model-uncertainty sampling (Uncertainty).
引用
收藏
页码:9858 / 9870
页数:13
相关论文
共 50 条
  • [41] A learning-by-metaphor human-machine system
    Rubin, Stuart H.
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 4439 - 4444
  • [42] Modes in human-machine systems: Constructs, representation, and classification
    Degani, A
    Shafto, M
    Kirlik, A
    INTERNATIONAL JOURNAL OF AVIATION PSYCHOLOGY, 1999, 9 (02): : 125 - 138
  • [43] Deep Reinforcement Learning for Active Target Tracking
    Jeong, Heejin
    Hassani, Hamed
    Morari, Manfred
    Lee, Daniel D.
    Pappas, George J.
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1825 - 1831
  • [44] Reinforcement Learning Based Online Active Learning for Human Activity Recognition
    Cui, Yulai
    Hiremath, Shruthi K.
    Plotz, Thomas
    PROCEEDINGS OF THE 2022 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, ISWC 2022, 2022, : 23 - 27
  • [45] Machine Learning-Supported Designing of Human-Machine Interfaces
    Bantay, Laszlo
    Abonyi, Janos
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [46] Machine learning and human-machine trust in healthcare: A systematic survey
    Lin, Han
    Han, Jiatong
    Wu, Pingping
    Wang, Jiangyan
    Tu, Juan
    Tang, Hao
    Zhu, Liuning
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (02) : 286 - 302
  • [47] Machine Learning for Human-Machine Systems With Advanced Persistent Threats
    Chen, Long
    Zhang, Wei
    Song, Yanqing
    Chen, Jianguo
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2024, 54 (06) : 753 - 761
  • [48] Adaptive Human-Machine Evaluation Framework Using Stochastic Gradient Descent-Based Reinforcement Learning for Dynamic Competing Network
    Kim, Jinbae
    Lee, Hyunsoo
    APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [49] Human-machine Interaction Based on Voice
    Lu, Jia-ni
    Qian, Hua
    Xiao, Ai-ping
    Shi, Miao-wen
    CONFERENCE ON MODELING, IDENTIFICATION AND CONTROL, 2012, 3 : 583 - 588
  • [50] Variable Impedance-based Human-machine Interaction Method Using Reinforcement Learning for Shared Steering Control of Intelligent Vehicle
    Han J.
    Zhao J.
    Zhu B.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2022, 58 (18): : 141 - 149