Reachability Verification Based Reliability Assessment for Deep Reinforcement Learning Controlled Robotics and Autonomous Systems

被引：2

作者：

Dong, Yi ^{[1
]}

Zhao, Xingyu ^{[2
]}

Wang, Sen ^{[3
]}

Huang, Xiaowei ^{[1
]}

机构：

[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 3BX, Lancs, England

[2] Univ Warwick, WMG, Coventry CV4 7AL, Warwick, England

[3] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 04期

关键词：

Safety; Power system reliability; Trajectory; Reachability analysis; Software reliability; Robots; Reinforcement learning; Robot safety; formal methods in robotics and automation; AI-enabled robotics;

D O I：

10.1109/LRA.2024.3364471

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Deep Reinforcement Learning (DRL) has achieved impressive performance in robotics and autonomous systems (RAS). A key challenge to its deployment in real-life operations is the presence of spuriously unsafe DRL policies. Unexplored states may lead the agent to make wrong decisions that could result in hazards, especially in applications where DRL-trained end-to-end controllers govern the behaviour of RAS. This letter proposes a novel quantitative reliability assessment framework for DRL-controlled RAS, leveraging verification evidence generated from formal reliability analysis of neural networks. A two-level verification framework is introduced to check the safety property with respect to inaccurate observations that are due to, e.g., environmental noise and state changes. Reachability verification tools are leveraged locally to generate safety evidence of trajectories. In contrast, at the global level, we quantify the overall reliability as an aggregated metric of local safety evidence, corresponding to a set of distinct tasks and their occurrence probabilities. The effectiveness of the proposed verification framework is demonstrated and validated via experiments on real RAS.

引用

页码：3299 / 3306

页数：8

共 29 条

[1] Agarwal Rishabh, 2021, Advances in Neural Information Processing Systems, V34
[2] Alshiekh M, 2018, AAAI CONF ARTIF INTE, P2669
[3] Althoff M., 2010, THESIS
[4] Amir Guy, 2021, 2021 Formal Methods in Computer Aided Design (FMCAD), P193, DOI 10.34727/2021/isbn.978-3-85448-046-4_28
[5] Berkenkamp F., 2019, Safe Exploration in Reinforcement Learning: Theory And Applications in Robotics
[6] Dalal G, 2018, Arxiv, DOI arXiv:1801.08757
[7] Dong Y., 2022, ACM Trans. Embedded Comput.Syst., V22, P1
[8] Reachability Analysis for Neural Feedback Systems using Regressive Polynomial Rule Inference
Dutta, Souradeep
Chen, Xin
Sankaranarayanan, Sriram
[J]. PROCEEDINGS OF THE 2019 22ND ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (HSCC '19), 2019, : 157 - 168
[9] Verifying Learning-Augmented Systems
Eliyahu, Tomer
Kazak, Yafim
Katz, Guy
Schapira, Michael
[J]. SIGCOMM '21: PROCEEDINGS OF THE 2021 ACM SIGCOMM 2021 CONFERENCE, 2021, : 305 - 318
[10] ReachNN*: A Tool for Reachability Analysis of Neural-Network Controlled Systems
Fan, Jiameng
Huang, Chao
Chen, Xin
Li, Wenchao
Zhu, Qi
[J]. AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2020), 2020, 12302 : 537 - 542

← 1 2 3 →