共 39 条
[1]
[Anonymous], 1989, LEARNING DELAYED REW
[2]
[Anonymous], 1994, Machine Learning, DOI DOI 10.1016/C2009-0-27542-8
[3]
Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics
[4]
Bernard A, 2005, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2005, P459
[5]
Bertsekas D. P., 1976, DYNAMIC PROGRAMMING
[8]
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f