共 18 条
[1]
[Anonymous], 2000, P 17 INT C MACH LEAR
[2]
[Anonymous], 2009, Stochastic Approximation: A Dynamical Systems Viewpoint
[3]
Bertsekas D. P., 2012, DYNAMIC PROGRAMMING, VII
[4]
Borkar V. S., 1998, SIAM J CONTROL OPTIM, V38, P662
[5]
A comprehensive survey of multiagent reinforcement learning
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2008, 38 (02)
:156-172
[6]
Chang Yu-han, 2003, Advances in Neural Information Processing Systems 17, NIPS'03, V16, P807
[8]
LITTMAN M, 1993, NEURAL NETW INNS, P45
[9]
Macua S. V., 2012, P IEEE INT WORKSH CO, P1
[10]
Pendrith M. D., 2000, Proceedings of the Fourth International Conference on Autonomous Agents, P404, DOI 10.1145/336595.337554