共 29 条
[1]
[Anonymous], P IEEE RSJ INT C INT
[2]
Bellman R., 1965, DYNAMIC PROGRAMMING, V81
[3]
Bhanu B, 2001, IEEE INT CONF ROBOT, P491, DOI 10.1109/ROBOT.2001.932598
[4]
BROADBENT R, 2005, P 2005 IEEE INT C RO
[5]
Dahmani Y., 2005, Journal of Computer Sciences, V1, P28, DOI 10.3844/jcssp.2005.28.30
[6]
EDAN Y, 2004, C ADV INT TECHN APPL
[7]
Glorennec P.Y., 2000, P EUR S INT TECHN ES, P14
[8]
A new Q-learning algorithm based on the Metropolis criterion
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2004, 34 (05)
:2140-2143
[9]
HOWARD A, 1999, THESIS U MELBOURNE D
[10]
Reinforcement learning: A survey
[J].
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH,
1996, 4
:237-285