COOL-MC: A Comprehensive Tool for Reinforcement Learning and Model Checking

被引:5
作者
Gross, Dennis [1 ]
Jansen, Nils [1 ]
Junges, Sebastian [1 ]
Perez, Guillermo A. [2 ]
机构
[1] Radboud Univ Nijmegen, Nijmegen, Netherlands
[2] Univ Antwerp, Flanders Make, Antwerp, Belgium
来源
DEPENDABLE SOFTWARE ENGINEERING. THEORIES, TOOLS, AND APPLICATIONS, SETTA | 2022年 / 13649卷
关键词
D O I
10.1007/978-3-031-21213-0_3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents COOL-MC, a tool that integrates state-of-the-art reinforcement learning (RL) and model checking. Specifically, the tool builds upon the OpenAI gym and the probabilistic model checker Storm. COOL-MC provides the following features: (1) a simulator to train RL policies in the OpenAI gym for Markov decision processes (MDPs) that are defined as input for Storm, (2) a new model builder for Storm, which uses callback functions to verify (neural network) RL policies, (3) formal abstractions that relate models and policies specified in OpenAI gym or Storm, and (4) algorithms to obtain bounds on the performance of so-called permissive policies. We describe the components and architecture of COOL-MC and demonstrate its features on multiple benchmark environments.
引用
收藏
页码:41 / 49
页数:9
相关论文
共 42 条
[1]  
Alshiekh M, 2018, AAAI CONF ARTIF INTE, P2669
[2]  
[Anonymous], 2016, LECT NOTES COMPUT SC, V9636, P130, DOI [10.1007/978-3-662-49674-9, DOI 10.1007/978-3-662-49674-9]
[3]  
[Anonymous], 2016, Openai gym
[4]  
Bacci Edoardo., 2022, Verified probabilistic policies for deep reinforcement learning
[5]  
Baier C, 2008, PRINCIPLES OF MODEL CHECKING, P1
[6]  
Boron J, 2020, IEEE CONF COMPU INTE, P728, DOI 10.1109/CoG47356.2020.9231609
[7]  
Brázdil T, 2014, LECT NOTES COMPUT SC, V8837, P98, DOI 10.1007/978-3-319-11936-6_8
[8]  
Cassez F, 2005, LECT NOTES COMPUT SC, V3653, P66, DOI 10.1007/11539452_9
[9]  
Clarke E.M., 2018, Handbook of Model Checking, V10, DOI [10.1007/978-3-319-10575-8, DOI 10.1007/978-3-319-10575-8]
[10]  
David Alexandre, 2015, Tools and Algorithms for the Construction and Analysis of Systems. 21st International Conference, TACAS 2015, held as part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2015. Proceedings: LNCS 9035, P206, DOI 10.1007/978-3-662-46681-0_16