共 37 条
[21]
Lagoudakis MichailG., 2003, P 20 INT C MACHINE L, P424
[22]
Adaptive critic learning techniques for engine torque and air-fuel ratio control
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2008, 38 (04)
:988-993
[23]
Mahadevan S, 2007, J MACH LEARN RES, V8, P2169
[24]
Mannor S., 2003, Proceedings of the Twentieth International Conference on International Conference on Machine Learning, ICML'03, V20, P512
[25]
Approximate gradient methods in policy-space optimization of Markov reward processes
[J].
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS,
2003, 13 (1-2)
:111-148
[27]
Variable resolution discretization in optimal control
[J].
MACHINE LEARNING,
2002, 49 (2-3)
:291-323
[28]
Munos R, 2006, J MACH LEARN RES, V7, P771
[29]
Ng A. Y., 2000, P 16 C UNC ART INT, P406