A Dynamic Programming Algorithm for Decentralized Markov Decision Processes with a Broadcast Structure

被引:13
作者
Wu, Jeff [1 ]
Lall, Sanjay [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
来源
49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC) | 2010年
关键词
COMPLEXITY;
D O I
10.1109/CDC.2010.5718187
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We give an optimal dynamic programming algorithm to solve a class of finite-horizon decentralized Markov decision processes (MDPs). We consider problems with a broadcast information structure that consists of a central node that only has access to its own state but can affect several outer nodes, while each outer node has access to both its own state and the central node's state, but cannot affect the other nodes. The solution to this problem involves a dynamic program similar to that of a centralized partially-observed Markov decision process.
引用
收藏
页码:6143 / 6148
页数:6
相关论文
共 10 条
[1]   Solving transition independent decentralized Markov decision processes [J].
Becker, R ;
Zilberstein, S ;
Lesser, V ;
Goldman, CV .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2004, 22 :423-455
[2]   The complexity of decentralized control of Markov decision processes [J].
Bernstein, DS ;
Givan, R ;
Immerman, N ;
Zilberstein, S .
MATHEMATICS OF OPERATIONS RESEARCH, 2002, 27 (04) :819-840
[3]  
Cassandra A. R., 2008, P 13 C UNC ART INT
[4]  
HSU K, 1982, IEEE T AUTOMAT CONTR, V27, P426, DOI 10.1109/TAC.1982.1102924
[5]   Identifying tractable decentralized control problems on the basis of information structure [J].
Mahajan, Aditya ;
Nayyar, Ashutosh ;
Teneketzis, Demosthenis .
2008 46TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, VOLS 1-3, 2008, :1440-1449
[6]   THE COMPLEXITY OF MARKOV DECISION-PROCESSES [J].
PAPADIMITRIOU, CH ;
TSITSIKLIS, JN .
MATHEMATICS OF OPERATIONS RESEARCH, 1987, 12 (03) :441-450
[7]   INTRACTABLE PROBLEMS IN CONTROL-THEORY [J].
PAPADIMITRIOU, CH ;
TSITSIKLIS, J .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1986, 24 (04) :639-654
[8]   OPTIMAL CONTROL OF PARTIALLY OBSERVABLE MARKOV PROCESSES OVER A FINITE HORIZON [J].
SMALLWOOD, RD ;
SONDIK, EJ .
OPERATIONS RESEARCH, 1973, 21 (05) :1071-1088
[9]  
Swigart J., 2010, AM CONTR C UNPUB
[10]   A COUNTEREXAMPLE IN STOCHASTIC OPTIMUM CONTROL [J].
WITSENHAUSEN, HS .
SIAM JOURNAL ON CONTROL, 1968, 6 (01) :131-+