A Human-Machine Agent Based on Active Reinforcement Learning for Target Classification in Wargame

被引:4
|
作者
Chen, Li [1 ,2 ]
Zhang, Yulong [2 ,3 ]
Feng, Yanghe [2 ]
Zhang, Longfei [2 ]
Liu, Zhong [2 ]
机构
[1] Army Logist Acad, Chongqing 400000, Peoples R China
[2] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Peoples R China
[3] 31002unit, Beijing 100000, Peoples R China
关键词
Task analysis; Target recognition; Man-machine systems; Data models; Costs; Radar imaging; Predictive models; Active reinforcement learning; human experience guidance; human-machine agent; machine data learning; target classification; SYSTEMS; NETWORK;
D O I
10.1109/TNNLS.2023.3236944
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To meet the requirements of high accuracy and low cost of target classification in modern warfare, and lay the foundation for target threat assessment, the article proposes a human-machine agent for target classification based on active reinforcement learning (TCARL_H-M), inferring when to introduce human experience guidance for model and how to autonomously classify detected targets into predefined categories with equipment information. To simulate different levels of human guidance, we set up two modes for the model: the easier-to-obtain but low-value-type cues simulated by Mode 1 and the labor-intensive but high-value class labels simulated by Mode 2. In addition, to analyze the respective roles of human experience guidance and machine data learning in target classification tasks, the article proposes a machine-based learner (TCARL_M) with zero human participation and a human-based interventionist with full human guidance (TCARL_H). Finally, based on the simulation data from a wargame, we carried out performance evaluation and application analysis for the proposed models in terms of target prediction and target classification, respectively, and the obtained results demonstrate that TCARL_H-M can not only greatly save labor costs, but achieve more competitive classification accuracy compared with our TCARL_M, TCARL_H, a purely supervised model-long short-term memory network (LSTM), a classic active learning algorithm-Query By Committee (QBC), and the common active learning model-uncertainty sampling (Uncertainty).
引用
收藏
页码:9858 / 9870
页数:13
相关论文
共 50 条
  • [1] Human-Machine Coadaptation Based on Reinforcement Learning with Policy Gradients
    Tahboub, Karim A.
    2019 8TH INTERNATIONAL CONFERENCE ON SYSTEMS AND CONTROL (ICSC'19), 2019, : 247 - 251
  • [2] Autonomous Boundary of Human-Machine Collaboration System Based on Reinforcement Learning
    Zhang, Qianqian
    Zhao, Yun-Bo
    Kang, Yu
    2020 AUSTRALIAN AND NEW ZEALAND CONTROL CONFERENCE (ANZCC 2020), 2020, : 160 - 165
  • [3] Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning
    Gao, Jensen
    Reddy, Siddharth
    Berseth, Glen
    Dragan, Anca D.
    Levine, Sergey
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7523 - 7530
  • [4] A Human-Machine Reinforcement Learning Method for Cooperative Energy Management
    Tao, Yuechuan
    Qiu, Jing
    Lai, Shuying
    Zhang, Xian
    Wang, Yunqi
    Wang, Guibin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (05) : 2974 - 2985
  • [5] Research on reinforcement learning based on PPO algorithm for human-machine intervention in autonomous driving
    Shi, Gaosong
    Zhao, Qinghai
    Wang, Jirong
    Dong, Xin
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (04): : 2424 - 2446
  • [6] A jaw based human-machine interface with machine learning
    Busch, Tobias
    Zeilfelder, Jennifer
    Zhou, Kai
    Stork, Wilhelm
    2019 IEEE SENSORS APPLICATIONS SYMPOSIUM (SAS), 2019,
  • [7] Human-Machine Interactive Learning Method Based on Active Learning for Smart Workshop Dynamic Scheduling
    Wang, Dongyuan
    Guan, Liuen
    Liu, Juan
    Ding, Chen
    Qiao, Fei
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2023, 53 (06) : 1038 - 1047
  • [8] Human-Machine Collaborative Reinforcement Learning for Power Line Flow Regulation
    Wang, Chenxi
    Du, Youtian
    Chang, Yuanlin
    Guo, Zihao
    Huang, Yanhao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 5087 - 5099
  • [9] Shapley-Optimized Reinforcement Learning for Human-Machine Collaboration Policy
    Zhang, Jie
    Niu, Yiqun
    He, Wei
    Jin, Cheng
    Wang, Chongjun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 2, 2025, 14851 : 291 - 300
  • [10] Multi-agent modelling and analysis of the knowledge learning of a human-machine hybrid intelligent organization with human-machine trust
    Xue, Chaogai
    Zhang, Haoxiang
    Cao, Haiwang
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)