Behavioral control task supervisor with memory based on reinforcement learning for human-multi-robot coordination systems

被引:5
作者
Huang, Jie [1 ,2 ,3 ]
Mo, Zhibin [1 ,2 ,3 ]
Zhang, Zhenyi [1 ,2 ,3 ]
Chen, Yutao [1 ,2 ,3 ]
机构
[1] Fuzhou Univ, Sch Elect Engn & Automat, Fuzhou 350108, Peoples R China
[2] Fuzhou Univ, 5G Ind Internet Inst, Fuzhou 350108, Peoples R China
[3] Fuzhou Univ, Key Lab Ind Automat Control Technol & Informat Pr, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Human-multi-robot coordination systems; Null-space-based behavioral control; Task supervisor; Reinforcement learning; Knowledge base; TP18; DECISION-MAKING; COLLABORATION; AGENTS;
D O I
10.1631/FITEE.2100280
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, a novel reinforcement learning task supervisor (RLTS) with memory in a behavioral control framework is proposed for human-multi-robot coordination systems (HMRCSs). Existing HMRCSs suffer from high decision-making time cost and large task tracking errors caused by repeated human intervention, which restricts the autonomy of multi-robot systems (MRSs). Moreover, existing task supervisors in the null-space-based behavioral control (NSBC) framework need to formulate many priority-switching rules manually, which makes it difficult to realize an optimal behavioral priority adjustment strategy in the case of multiple robots and multiple tasks. The proposed RLTS with memory provides a detailed integration of the deep Q-network (DQN) and long short-term memory (LSTM) knowledge base within the NSBC framework, to achieve an optimal behavioral priority adjustment strategy in the presence of task conflict and to reduce the frequency of human intervention. Specifically, the proposed RLTS with memory begins by memorizing human intervention history when the robot systems are not confident in emergencies, and then reloads the history information when encountering the same situation that has been tackled by humans previously. Simulation results demonstrate the effectiveness of the proposed RLTS. Finally, an experiment using a group of mobile robots subject to external noise and disturbances validates the effectiveness of the proposed RLTS with memory in uncertain real-world environments.
引用
收藏
页码:1174 / 1188
页数:15
相关论文
共 50 条
  • [21] Style-Based Reinforcement Learning: Task Decoupling Personalization for Human-Robot Collaboration
    Bonyani, Mandi
    Soleymani, Maryam
    Wang, Chao
    UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION, PT I, UAHCI 2024, 2024, 14696 : 197 - 212
  • [22] Heterogeneous Multi-robot Task Allocation and Scheduling via Reinforcement Learning
    Dai, Weiheng
    Rai, Utkarsh
    Chiun, Jimmy
    Cao, Yuhong
    Sartoretti, Guillaume
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2654 - 2661
  • [23] A review of developments in reinforcement learning for multi-robot systems
    Ma, Lei, 1600, Science Press (49): : 1032 - 1044
  • [24] A Decision Control Method for Autonomous Driving Based on Multi-Task Reinforcement Learning
    Cai, Yingfeng
    Yang, Shaoqing
    Wang, Hai
    Teng, Chenglong
    Chen, Long
    IEEE ACCESS, 2021, 9 (09): : 154553 - 154562
  • [25] Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task
    Viejo, Guillaume
    Girard, Benoit
    Procyk, Emmanuel
    Khamassi, Mehdi
    BEHAVIOURAL BRAIN RESEARCH, 2018, 355 : 76 - 89
  • [26] Multi-agent reinforcement learning behavioral control for nonlinear second-order systems
    Zhang, Zhenyi
    Huang, Jie
    Pan, Congjie
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (06) : 869 - 886
  • [27] Coordination of adaptive working memory and reinforcement learning systems explaining choice and reaction time in a human experiment
    Guillaume D Viejo
    Mehdi Khamassi
    Andrea Brovelli
    Benoît Girard
    BMC Neuroscience, 15 (Suppl 1)
  • [28] Behavioral-Fusion Control Based on Reinforcement Learning
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Wu, Chun-Ju
    Wu, Cheng-Shong
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 401 - 406
  • [29] Human-to-Robot Handover Based on Reinforcement Learning
    Kim, Myunghyun
    Yang, Sungwoo
    Kim, Beomjoon
    Kim, Jinyeob
    Kim, Donghan
    SENSORS, 2024, 24 (19)
  • [30] Hierarchical Deep Reinforcement Learning for Computation Offloading in Autonomous Multi-Robot Systems
    Gao, Wen
    Yu, Zhiwen
    Wang, Liang
    Cui, Helei
    Guo, Bin
    Xiong, Hui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 540 - 547