AN INFORMATION BASED APPROACH TO STOCHASTIC CONTROL PROBLEMS

被引:9
作者
Bania, Piotr [1 ]
机构
[1] AGH Univ Sci & Technol, Fac Automat Control & Robot, Al A Mickiewicza 30, PL-30059 Krakow, Poland
关键词
stochastic control; feedback; information; entropy; ENTROPY FORMULATION;
D O I
10.34768/amcs-2020-0002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An information based method for solving stochastic control problems with partial observation is proposed. First, information-theoretic lower bounds of the cost function are analysed. It is shown, under rather weak assumptions, that reduction in the expected cost with closed-loop control compared with the best open-loop strategy is upper bounded by a non-decreasing function of mutual information between control variables and the state trajectory. On the basis of this result, an information based control (IBC) method is developed. The main idea of IBC consists in replacing the original control task by a sequence of control problems that are relatively easy to solve and such that information about the system state is actively generated. Two examples of the IBC operation are given. It is shown that the method is able to find an optimal solution without using dynamic programming at least in these examples. Hence the computational complexity of IBC is substantially smaller than that of dynamic programming, which is the main advantage of the proposed method.
引用
收藏
页码:23 / 34
页数:12
相关论文
共 40 条
[1]   An Information-Based Learning Approach to Dual Control [J].
Alpcan, Tansu ;
Shames, Iman .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (11) :2736-2748
[2]   NONLINEAR BAYESIAN ESTIMATION USING GAUSSIAN SUM APPROXIMATIONS [J].
ALSPACH, DL ;
SORENSON, HW .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1972, AC17 (04) :439-&
[3]  
[Anonymous], 2006, Elements of information theory
[4]  
[Anonymous], BAYESIAN FILTERING S
[5]  
[Anonymous], THESIS
[6]  
[Anonymous], THESIS
[7]  
[Anonymous], P 30 INT C INT C MAC
[8]  
[Anonymous], 2017, Entropy
[9]  
[Anonymous], 1996, Chance and Decision. Stochastic Control in Discrete Time
[10]  
Astrom KJ, 2013, ADAPTIVE CONTROL