Efficient Safe Control via Deep Reinforcement Learning and Supervisory Control - Case Study on Multi-Robot

被引:2
|
作者
Konishi, Masahiro [1 ]
Sasaki, Tomotake [2 ]
Cai, Kai [1 ]
机构
[1] Osaka Metropolitan Univ, Osaka, Japan
[2] Fujitsu Ltd, Kawasaki, Kanagawa, Japan
来源
IFAC PAPERSONLINE | 2022年 / 55卷 / 28期
关键词
Safe Control; Supervisory Control; Deep Reinforcement Learning;
D O I
10.1016/j.ifacol.2022.10.318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Safe control has recently attracted much attention due to its applications in safetycritical cyber-physical systems. Supervisory control theory (SCT) is a formal control method that provides correct-by-construction safety certificates, but is computationally inefficient when the number of system components is large. On the other hand, deep reinforcement learning (DRL) provides a toolbox of efficient algorithms to compute control decisions even for very large state space, but does not always guarantee safety. In this paper, we propose to synergize SCT and DRL into a new efficient safe control approach. Specifically, we first employ DRL algorithms to efficiently compute sub-optimal solutions which may be unsafe; then we convert the obtained solutions into a standard supervisory control problem with an automaton (plant model) and a set of unsafe states (safety specification); finally we use SCT to synthesize a supervisor with a safety certificate. A case study of multi-robot warehouse logistic automation is conducted to demonstrate the efficiency of this proposed approach. Copyright (C) 2022 The Authors.
引用
收藏
页码:16 / 21
页数:6
相关论文
共 50 条
  • [1] Multi-Robot Flocking Control Based on Deep Reinforcement Learning
    Zhu, Pengming
    Dai, Wei
    Yao, Weijia
    Ma, Junchong
    Zeng, Zhiwen
    Lu, Huimin
    IEEE ACCESS, 2020, 8 : 150397 - 150406
  • [2] Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning
    Junchong Ma
    Huimin Lu
    Junhao Xiao
    Zhiwen Zeng
    Zhiqiang Zheng
    Journal of Intelligent & Robotic Systems, 2020, 99 : 371 - 386
  • [3] Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning
    Ma, Junchong
    Lu, Huimin
    Xiao, Junhao
    Zeng, Zhiwen
    Zheng, Zhiqiang
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 99 (02) : 371 - 386
  • [4] Obtaining Robust Control and Navigation Policies for Multi-robot Navigation via Deep Reinforcement Learning
    Jestel, Christian
    Surmann, Harmtmut
    Stenzel, Jonas
    Urbann, Oliver
    Brehler, Marius
    2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 48 - 54
  • [5] Connectivity Guaranteed Multi-robot Navigation via Deep Reinforcement Learning
    Lin, Juntong
    Yang, Xuyun
    Zheng, Peiwei
    Cheng, Hui
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [6] Cooperative exploration based on supervisory control of multi-robot systems
    Dai, Xuefeng
    Jiang, Laihao
    Zhao, Yan
    APPLIED INTELLIGENCE, 2016, 45 (01) : 18 - 29
  • [7] Cooperative exploration based on supervisory control of multi-robot systems
    Xuefeng Dai
    Laihao Jiang
    Yan Zhao
    Applied Intelligence, 2016, 45 : 18 - 29
  • [8] Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey
    Orr, James
    Dutta, Ayan
    SENSORS, 2023, 23 (07)
  • [9] Hierarchical Multi-Robot Pursuit with Deep Reinforcement Learning and Navigation Planning
    Chen, Wenzhang
    Zhu, Yuanheng
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 1274 - 1280
  • [10] Hierarchical multi-robot navigation and formation in unknown environments via deep reinforcement learning and distributed optimization
    Chang, Lu
    Shan, Liang
    Zhang, Weilong
    Dai, Yuewei
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2023, 83