共 32 条
[21]
Marivate V., 2013, P WORKSH 27 AAAI C A
[22]
Michie D., 1968, Machine intelligence, P137
[23]
Ng AY, 1999, MACHINE LEARNING, PROCEEDINGS, P278
[24]
Puterman M., 2009, MARKOV DECISION PROC, V414
[26]
Singh SP, 1996, MACH LEARN, V22, P123, DOI 10.1007/BF00114726
[28]
Taylor ME, 2009, J MACH LEARN RES, V10, P1633
[29]
ASYNCHRONOUS STOCHASTIC-APPROXIMATION AND Q-LEARNING
[J].
MACHINE LEARNING,
1994, 16 (03)
:185-202
[30]
van Hasselt H. P., 2011, THESIS