共 50 条
[1]
Multiagent reinforcement learning using function approximation
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2000, 30 (04)
:485-497
[2]
[Anonymous], 1999, Learning in Graphical Models
[3]
[Anonymous], 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems-Volume
[4]
Barto AG, 2003, DISCRETE EVENT DYN S, V13, P343
[5]
Bertsekas D. P., 1987, DYNAMIC PROGRAMMING
[6]
Bertsekas DP, 2012, DYNAMIC PROGRAMMING, V2
[8]
Chalkiadakis G., 2003, Autonomous Agents and Multiagent Systems, P709, DOI 10.1145/860575.860689
[9]
Christopher JohnCornish Hella by Watkins., 1989, Learning from delayed rewards
[10]
Claus C, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P746