共 30 条
[1]
[Anonymous], 2007, DYNAMIC PROGRAMMING
[2]
[Anonymous], 1996, LIDSP2349 MIT
[3]
[Anonymous], 2003, ITERATIVE METHODS SP, DOI DOI 10.1137/1.9780898718003
[4]
BARTO AG, 1994, ADV NEURAL INFORMATI, V6, P687
[5]
BASU A, 2006, 200625 IND I SCI DEP
[6]
BERTSEKAS D, 2004, LEARNING APPROXIMATE
[7]
Bertsekas Dimitri, 1996, Neuro dynamic programming
[8]
BERTSEKAS DP, 2007, LIDS2754 MIT
[9]
Boyan J., 2002, MACH LEARN, V49, P1
[10]
Bradtke SJ, 1996, MACH LEARN, V22, P33, DOI 10.1007/BF00114723