共 50 条
[41]
Fuzzy Reinforcement Learning Control for Decentralized Partially Observable Markov Decision Processes
[J].
IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011),
2011,
:1422-1429
[42]
Topological Value Iteration Algorithm for Markov Decision Processes
[J].
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE,
2007,
:1860-1865
[44]
New prioritized value iteration for Markov decision processes
[J].
Artificial Intelligence Review,
2012, 37
:157-167
[45]
Approximate Policy Iteration for Markov Control Revisited
[J].
COMPLEX ADAPTIVE SYSTEMS 2012,
2012, 12
:90-95
[48]
Mean Field Approximation of the Policy Iteration Algorithm for Graph-based Markov Decision Processes
[J].
ECAI 2006, PROCEEDINGS,
2006, 141
:595-+
[49]
Average-Reward Decentralized Markov Decision Processes
[J].
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE,
2007,
:1997-2002
[50]
Solving transition independent decentralized Markov decision processes
[J].
Journal of Artificial Intelligence Research,
1600, 22
:423-455