共 32 条
[1]
Abul O.(2000)‘‘Multiagent reinforcement learning using function approximation’‘ IEEE Trans. Syst., Man, Cyber 30 485-497
[2]
Polat F.(1999)‘‘Decision-Theoretic Planning: Structural Assumptions and Computational Leverage’‘ J. Artif. Intelligence Res. Vol. 11 1-94
[3]
Alhajj R.(2000)‘‘Stochastic dynamic programming with factored representations’‘ Artif. Intell. Vol. 121 49-107
[4]
Boutilier C.(1998)‘‘Elevator group control using multiple reinforcement learning agents’‘ Machine Learning 33 235-262
[5]
Dean T.(1997)‘‘Abstraction and approximate decision-theoretic planning’‘ Artif. Intell. 89 219-283
[6]
Hanks S.(1989)‘‘Negotiation task decomposition and allocation using partial global planning’‘ Distributed Artificial Intelligence vol. 2 229-243
[7]
Boutilier C.(2003)‘‘Equivalence notions and model minimization in Markov decision processes’‘ Artif. Intell. vol. 147 163-223
[8]
Dearden R.(1996)‘‘Reinforcement Learning A survey’‘ J. Artif. Intell. Res. vol. 4 237-285
[9]
Goldszmidt M.(1951)‘‘A stochastic approximation method’‘ Ann. Math. Stat. vol. 22 400-407
[10]
Crites R. H.(1989)‘‘Constraint-directed negotiation of resource reallocations’‘ Distributed Artificial Intelligence vol. 2 163-193