Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning

被引:41
作者
Akazaki, Takumi [1 ,2 ]
Liu, Shuang [3 ]
Yamagata, Yoriyuki [4 ]
Duan, Yihai [3 ]
Hao, Jianye [3 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Japan Soc Promot Sci, Tokyo, Japan
[3] Tianjin Univ, Sch Software, Tianjin, Peoples R China
[4] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
来源
FORMAL METHODS | 2018年 / 10951卷
关键词
MARKOV DECISION-PROCESSES;
D O I
10.1007/978-3-319-95582-7_27
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the rapid development of software and distributed computing, Cyber-Physical Systems (CPS) are widely adopted in many application areas, e.g., smart grid, autonomous automobile. It is difficult to detect defects in CPS models due to the complexities involved in the software and physical systems. To find defects in CPS models efficiently, robustness guided falsification of CPS is introduced. Existing methods use several optimization techniques to generate counterexamples, which falsify the given properties of a CPS. However those methods may require a large number of simulation runs to find the counterexample and are far from practical. In this work, we explore state-of-the-art Deep Reinforcement Learning (DRL) techniques to reduce the number of simulation runs required to find such counterexamples. We report our method and the preliminary evaluation results.
引用
收藏
页码:456 / 465
页数:10
相关论文
共 28 条
[1]   Probabilistic Temporal Logic Falsification of Cyber-Physical Systems [J].
Abbas, Houssam ;
Fainekos, Georgios ;
Sankaranarayanan, Sriram ;
Ivancic, Franjo ;
Gupta, Aarti .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 12
[2]  
Abbas H, 2012, ANN ALLERTON CONF, P1594, DOI 10.1109/Allerton.2012.6483411
[3]   Falsification of Conditional Safety Properties for Cyber-Physical Systems with Gaussian Process Regression [J].
Akazaki, Takumi .
RUNTIME VERIFICATION, (RV 2016), 2016, 10012 :439-446
[4]  
Annapureddy Y, 2011, LECT NOTES COMPUT SC, V6605, P254, DOI 10.1007/978-3-642-19835-9_21
[5]  
[Anonymous], 2017, CORR
[6]  
[Anonymous], 2016, ASYNCHRONOUS METHODS
[7]  
[Anonymous], 2016, OPENAI GYM
[8]  
Bartocci Ezio, 2018, Lectures on Runtime. Verification Introductory and Advanced Topics. LNCS 10457, P135, DOI 10.1007/978-3-319-75632-5_5
[9]   System design of stochastic models using robustness of temporal properties [J].
Bartocci, Ezio ;
Bortolussi, Luca ;
Nenzi, Laura ;
Sanguinetti, Guido .
THEORETICAL COMPUTER SCIENCE, 2015, 587 :3-25
[10]   On the Robustness of Temporal Properties for Stochastic Models [J].
Bartocci, Ezio ;
Bortolussi, Luca ;
Nenzi, Laura ;
Sanguinetti, Guido .
ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2013, (125) :3-19