Efficient Safe Control via Deep Reinforcement Learning and Supervisory Control - Case Study on Multi-Robot

被引:2
|
作者
Konishi, Masahiro [1 ]
Sasaki, Tomotake [2 ]
Cai, Kai [1 ]
机构
[1] Osaka Metropolitan Univ, Osaka, Japan
[2] Fujitsu Ltd, Kawasaki, Kanagawa, Japan
来源
IFAC PAPERSONLINE | 2022年 / 55卷 / 28期
关键词
Safe Control; Supervisory Control; Deep Reinforcement Learning;
D O I
10.1016/j.ifacol.2022.10.318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Safe control has recently attracted much attention due to its applications in safetycritical cyber-physical systems. Supervisory control theory (SCT) is a formal control method that provides correct-by-construction safety certificates, but is computationally inefficient when the number of system components is large. On the other hand, deep reinforcement learning (DRL) provides a toolbox of efficient algorithms to compute control decisions even for very large state space, but does not always guarantee safety. In this paper, we propose to synergize SCT and DRL into a new efficient safe control approach. Specifically, we first employ DRL algorithms to efficiently compute sub-optimal solutions which may be unsafe; then we convert the obtained solutions into a standard supervisory control problem with an automaton (plant model) and a set of unsafe states (safety specification); finally we use SCT to synthesize a supervisor with a safety certificate. A case study of multi-robot warehouse logistic automation is conducted to demonstrate the efficiency of this proposed approach. Copyright (C) 2022 The Authors.
引用
收藏
页码:16 / 21
页数:6
相关论文
共 50 条
  • [1] Safe multi-agent reinforcement learning for multi-robot control
    Gu, Shangding
    Kuba, Jakub Grudzien
    Chen, Yuanpei
    Du, Yali
    Yang, Long
    Knoll, Alois
    Yang, Yaodong
    ARTIFICIAL INTELLIGENCE, 2023, 319
  • [2] Multi-Robot Flocking Control Based on Deep Reinforcement Learning
    Zhu, Pengming
    Dai, Wei
    Yao, Weijia
    Ma, Junchong
    Zeng, Zhiwen
    Lu, Huimin
    IEEE ACCESS, 2020, 8 : 150397 - 150406
  • [3] Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning
    Ma, Junchong
    Lu, Huimin
    Xiao, Junhao
    Zeng, Zhiwen
    Zheng, Zhiqiang
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 99 (02) : 371 - 386
  • [4] Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning
    Junchong Ma
    Huimin Lu
    Junhao Xiao
    Zhiwen Zeng
    Zhiqiang Zheng
    Journal of Intelligent & Robotic Systems, 2020, 99 : 371 - 386
  • [5] Obtaining Robust Control and Navigation Policies for Multi-robot Navigation via Deep Reinforcement Learning
    Jestel, Christian
    Surmann, Harmtmut
    Stenzel, Jonas
    Urbann, Oliver
    Brehler, Marius
    2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 48 - 54
  • [6] Multi-robot Formation Control Using Reinforcement Learning Method
    Zuo, Guoyu
    Han, Jiatong
    Han, Guansheng
    ADVANCES IN SWARM INTELLIGENCE, PT 1, PROCEEDINGS, 2010, 6145 : 667 - 674
  • [7] Connectivity Guaranteed Multi-robot Navigation via Deep Reinforcement Learning
    Lin, Juntong
    Yang, Xuyun
    Zheng, Peiwei
    Cheng, Hui
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [8] Decentralized Control of Multi-Robot System in Cooperative Object Transportation Using Deep Reinforcement Learning
    Zhang, Lin
    Sun, Yufeng
    Barth, Andrew
    Ma, Ou
    IEEE ACCESS, 2020, 8 : 184109 - 184119
  • [9] Control Fusion for Safe Multi-Robot Coordination
    Bostelman, Roger
    Marvel, Jeremy
    MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2014, 2014, 9121
  • [10] Distributed safe reinforcement learning for multi-robot motion planning
    Lu, Yang
    Guo, Yaohua
    Zhao, Guoxiang
    Zhu, Minghui
    2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 1209 - 1214