Reachability Verification Based Reliability Assessment for Deep Reinforcement Learning Controlled Robotics and Autonomous Systems

被引:2
作者
Dong, Yi [1 ]
Zhao, Xingyu [2 ]
Wang, Sen [3 ]
Huang, Xiaowei [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool L69 3BX, Lancs, England
[2] Univ Warwick, WMG, Coventry CV4 7AL, Warwick, England
[3] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
关键词
Safety; Power system reliability; Trajectory; Reachability analysis; Software reliability; Robots; Reinforcement learning; Robot safety; formal methods in robotics and automation; AI-enabled robotics;
D O I
10.1109/LRA.2024.3364471
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep Reinforcement Learning (DRL) has achieved impressive performance in robotics and autonomous systems (RAS). A key challenge to its deployment in real-life operations is the presence of spuriously unsafe DRL policies. Unexplored states may lead the agent to make wrong decisions that could result in hazards, especially in applications where DRL-trained end-to-end controllers govern the behaviour of RAS. This letter proposes a novel quantitative reliability assessment framework for DRL-controlled RAS, leveraging verification evidence generated from formal reliability analysis of neural networks. A two-level verification framework is introduced to check the safety property with respect to inaccurate observations that are due to, e.g., environmental noise and state changes. Reachability verification tools are leveraged locally to generate safety evidence of trajectories. In contrast, at the global level, we quantify the overall reliability as an aggregated metric of local safety evidence, corresponding to a set of distinct tasks and their occurrence probabilities. The effectiveness of the proposed verification framework is demonstrated and validated via experiments on real RAS.
引用
收藏
页码:3299 / 3306
页数:8
相关论文
共 29 条
  • [1] Agarwal Rishabh, 2021, Advances in Neural Information Processing Systems, V34
  • [2] Alshiekh M, 2018, AAAI CONF ARTIF INTE, P2669
  • [3] Althoff M., 2010, THESIS
  • [4] Amir Guy, 2021, 2021 Formal Methods in Computer Aided Design (FMCAD), P193, DOI 10.34727/2021/isbn.978-3-85448-046-4_28
  • [5] Berkenkamp F., 2019, Safe Exploration in Reinforcement Learning: Theory And Applications in Robotics
  • [6] Dalal G, 2018, Arxiv, DOI arXiv:1801.08757
  • [7] Dong Y., 2022, ACM Trans. Embedded Comput.Syst., V22, P1
  • [8] Reachability Analysis for Neural Feedback Systems using Regressive Polynomial Rule Inference
    Dutta, Souradeep
    Chen, Xin
    Sankaranarayanan, Sriram
    [J]. PROCEEDINGS OF THE 2019 22ND ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (HSCC '19), 2019, : 157 - 168
  • [9] Verifying Learning-Augmented Systems
    Eliyahu, Tomer
    Kazak, Yafim
    Katz, Guy
    Schapira, Michael
    [J]. SIGCOMM '21: PROCEEDINGS OF THE 2021 ACM SIGCOMM 2021 CONFERENCE, 2021, : 305 - 318
  • [10] ReachNN*: A Tool for Reachability Analysis of Neural-Network Controlled Systems
    Fan, Jiameng
    Huang, Chao
    Chen, Xin
    Li, Wenchao
    Zhu, Qi
    [J]. AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2020), 2020, 12302 : 537 - 542