On the adaptive control of a class of partially observed Markov decision processes

被引:0
作者
Hsu, Shun-Pin [1 ]
Arapostathis, Ari [2 ]
机构
[1] Natl Chung Hsing Univ, Dept Elect Engn, Taichung 402, Taiwan
[2] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
关键词
Adaptive control; Markov decision processes; Average-cost optimality; AVERAGE COST CRITERION; CHAINS;
D O I
10.1016/j.jmaa.2011.03.004
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
This paper is concerned with the adaptive control problem, over the infinite horizon, for partially observable Markov decision processes whose transition functions are parameterized by an unknown vector. We treat finite models and impose relatively mild assumptions on the transition function. Provided that a sequence of parameter estimates converging in probability to the true parameter value is available, we show that the certainty equivalence adaptive policy is optimal in the long-run average sense. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 13 条