Certified policy synthesis for general Markov decision processes: An application in building automation systems

被引:11
作者
Haesaert, Sofie [1 ]
Cauchi, Nathalie [2 ]
Abate, Alessandro [2 ]
机构
[1] Tech Univ Eindhoven, Dept Elect Engn, Eindhoven, Netherlands
[2] Univ Oxford, Dept Comp Sci, Wolfson Bldg,Parks Rd, Oxford, England
关键词
Verification; Synthesis; General Markov decision processes; Safety; Building automation systems; Temperature control; MODEL-PREDICTIVE CONTROL; PROBABILITY-MEASURES; ENERGY MANAGEMENT; REDUCTION; EXISTENCE;
D O I
10.1016/j.peva.2017.09.005
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present an industrial application of new approximate similarity relations for Markov models, and show that they are key for the synthesis of control strategies. Typically, modern engineering systems are modelled using complex and high-order models which make the correct-by-design controller construction computationally hard. Using the new approximate similarity relations, this complexity is reduced and we provide certificates on the performance of the synthesised policies. The application deals with stochastic models for the thermal dynamics in a "smart building" setup: such building automation system set-up can be described by discrete-time Markov decision processes evolving over an uncountable state space and endowed with an output quantifying the room temperature. The new similarity relations draw a quantitative connection between different levels of model abstraction, and allow to quantitatively refine over complex models control strategies synthesised on simpler ones. The new relations, underpinned by the use of metrics, allow in particular for a useful trade-off between deviations over probability distributions on states and distances between model outputs. We develop a software toolbox supporting the application and the computational implementation of these new relations. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:75 / 103
页数:29
相关论文
共 6 条
[1]   VERIFICATION OF GENERAL MARKOV DECISION PROCESSES BY APPROXIMATE SIMILARITY RELATIONS AND POLICY REFINEMENT [J].
Haesaert, Sofie ;
Soudjani, Sadegh Esmaeil Zadeh ;
Abate, Alessandro .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2017, 55 (04) :2333-2367
[2]   Temporal logic control of general Markov decision processes by approximate policy refinement [J].
Haesaert, Sofie ;
Soudjani, Sadegh ;
Abate, Alessandro .
IFAC PAPERSONLINE, 2018, 51 (16) :73-78
[3]   Synthesis for PCTL in Parametric Markov Decision Processes [J].
Hahn, Ernst Moritz ;
Han, Tingting ;
Zhang, Lijun .
NASA FORMAL METHODS, 2011, 6617 :146-+
[4]   1-2-3-Go! Policy Synthesis for Parameterized Markov Decision Processes via Decision-Tree Learning and Generalization [J].
Azeem, Muqsit ;
Chakraborty, Debraj ;
Kanav, Sudeep ;
Kretinsky, Jan ;
Mohagheghi, Mohammadsadegh ;
Mohr, Stefanie ;
Weininger, Maximilian .
VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION, VMCAI 2025, PT II, 2025, 15530 :97-120
[5]   Multi-objective Robust Strategy Synthesis for Interval Markov Decision Processes [J].
Hahn, Ernst Moritz ;
Hashemi, Vahid ;
Hermanns, Holger ;
Lahijanian, Morteza ;
Turrini, Andrea .
QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2017), 2017, 10503 :207-223
[6]   In-situ backup virtual sensor application in building automation systems toward virtual sensing-enabled digital twins [J].
Choi, Youngwoong ;
Yoon, Sungmin .
CASE STUDIES IN THERMAL ENGINEERING, 2025, 66