Behavioral control task supervisor with memory based on reinforcement learning for human-multi-robot coordination systems

被引:5
作者
Huang, Jie [1 ,2 ,3 ]
Mo, Zhibin [1 ,2 ,3 ]
Zhang, Zhenyi [1 ,2 ,3 ]
Chen, Yutao [1 ,2 ,3 ]
机构
[1] Fuzhou Univ, Sch Elect Engn & Automat, Fuzhou 350108, Peoples R China
[2] Fuzhou Univ, 5G Ind Internet Inst, Fuzhou 350108, Peoples R China
[3] Fuzhou Univ, Key Lab Ind Automat Control Technol & Informat Pr, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
Human-multi-robot coordination systems; Null-space-based behavioral control; Task supervisor; Reinforcement learning; Knowledge base; TP18; DECISION-MAKING; COLLABORATION; AGENTS;
D O I
10.1631/FITEE.2100280
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this study, a novel reinforcement learning task supervisor (RLTS) with memory in a behavioral control framework is proposed for human-multi-robot coordination systems (HMRCSs). Existing HMRCSs suffer from high decision-making time cost and large task tracking errors caused by repeated human intervention, which restricts the autonomy of multi-robot systems (MRSs). Moreover, existing task supervisors in the null-space-based behavioral control (NSBC) framework need to formulate many priority-switching rules manually, which makes it difficult to realize an optimal behavioral priority adjustment strategy in the case of multiple robots and multiple tasks. The proposed RLTS with memory provides a detailed integration of the deep Q-network (DQN) and long short-term memory (LSTM) knowledge base within the NSBC framework, to achieve an optimal behavioral priority adjustment strategy in the presence of task conflict and to reduce the frequency of human intervention. Specifically, the proposed RLTS with memory begins by memorizing human intervention history when the robot systems are not confident in emergencies, and then reloads the history information when encountering the same situation that has been tackled by humans previously. Simulation results demonstrate the effectiveness of the proposed RLTS. Finally, an experiment using a group of mobile robots subject to external noise and disturbances validates the effectiveness of the proposed RLTS with memory in uncertain real-world environments.
引用
收藏
页码:1174 / 1188
页数:15
相关论文
共 50 条
  • [1] Behavioral control task supervisor with memory based on reinforcement learning for human—multi-robot coordination systems面向人—多机器人协同系统的带记忆强化学习行为控制任务管理器
    Jie Huang
    Zhibin Mo
    Zhenyi Zhang
    Yutao Chen
    Frontiers of Information Technology & Electronic Engineering, 2022, 23 : 1174 - 1188
  • [2] Shared Impedance Control Based on Reinforcement Learning in a Human-Robot Collaboration Task
    Wu, Min
    He, Yanhao
    Liu, Steven
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, 2020, 980 : 95 - 103
  • [3] A Control Barrier Function Approach to Human-multi-robot Safe Interaction
    Lippi, Martina
    Marino, Alessandro
    2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 604 - 609
  • [4] Human-Multi-Robot Task Allocation in Agricultural Settings: a Mixed Integer Linear Programming Approach
    Lippi, Martina
    Gallou, Jorand
    Palmieri, Jozsef
    Gasparri, Andrea
    Marino, Alessandro
    2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 1056 - 1062
  • [5] Multi-agent reinforcement learning for redundant robot control in task-space
    Perrusquia, Adolfo
    Yu, Wen
    Li, Xiaoou
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (01) : 231 - 241
  • [6] Multi-agent reinforcement learning for redundant robot control in task-space
    Adolfo Perrusquía
    Wen Yu
    Xiaoou Li
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 231 - 241
  • [7] Research on Motion Planning Based on Flocking Control and Reinforcement Learning for Multi-Robot Systems
    Wang, Minghui
    Zeng, Bi
    Wang, Qiujie
    MACHINES, 2021, 9 (04)
  • [8] Applying Reinforcement Learning to Multi-robot Team Coordination
    Sanz, Yolanda
    de Lope, Javier
    Antonio Martin H, Jose
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2008, 5271 : 625 - +
  • [9] Cooperative Multi-Robot Task Allocation with Reinforcement Learning
    Park, Bumjin
    Kang, Cheongwoong
    Choi, Jaesik
    APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [10] Adaptive Multi-Task Human-Robot Interaction Based on Human Behavioral Intention
    Fu, Jian
    Du, Jinyu
    Teng, Xiang
    Fu, Yuxiang
    Wu, Lu
    IEEE ACCESS, 2021, 9 : 133762 - 133773