Efficient Safe Control via Deep Reinforcement Learning and Supervisory Control - Case Study on Multi-Robot

被引:4
作者
Konishi, Masahiro [1 ]
Sasaki, Tomotake [2 ]
Cai, Kai [1 ]
机构
[1] Osaka Metropolitan Univ, Osaka, Japan
[2] Fujitsu Ltd, Kawasaki, Kanagawa, Japan
关键词
Safe Control; Supervisory Control; Deep Reinforcement Learning;
D O I
10.1016/j.ifacol.2022.10.318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Safe control has recently attracted much attention due to its applications in safetycritical cyber-physical systems. Supervisory control theory (SCT) is a formal control method that provides correct-by-construction safety certificates, but is computationally inefficient when the number of system components is large. On the other hand, deep reinforcement learning (DRL) provides a toolbox of efficient algorithms to compute control decisions even for very large state space, but does not always guarantee safety. In this paper, we propose to synergize SCT and DRL into a new efficient safe control approach. Specifically, we first employ DRL algorithms to efficiently compute sub-optimal solutions which may be unsafe; then we convert the obtained solutions into a standard supervisory control problem with an automaton (plant model) and a set of unsafe states (safety specification); finally we use SCT to synthesize a supervisor with a safety certificate. A case study of multi-robot warehouse logistic automation is conducted to demonstrate the efficiency of this proposed approach. Copyright (C) 2022 The Authors.
引用
收藏
页码:16 / 21
页数:6
相关论文
共 16 条
[1]  
Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
[2]  
Belta C, 2017, STUD SYST DECIS CONT, V89, P1, DOI 10.1007/978-3-319-50763-7
[3]  
Brockman G, 2016, Arxiv, DOI arXiv:1606.01540
[4]  
Cai K., 2021, ENCY SYSTEMS CONTROL, V2, P2245
[5]   Warehouse automation by logistic robotic networks: a cyber-physical control approach [J].
Cai, Kai .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (05) :693-704
[6]  
Kasahara M., 2021, PROC 29 MEDITERRANEA
[7]  
Kasahara M., 2021, PROC 17 INT C AUTOMA
[8]  
Konishi M, 2021, PROC 64 JAPAN JOINT, P388
[9]  
Liang E., 2018, PROC 35 INT C MACHIN
[10]  
Mnih V, 2016, PR MACH LEARN RES, V48