共 30 条
[1]
Aberdeen D., 2007, P 17 INT C AUT PLANN, P10
[3]
Baxter J., 1999, Direct gradient-based reinforcement learning: I. gradient estimation algorithms
[5]
BEASLEY JE, 2005, OR LIB
[7]
Blazewicz J., 1993, SCHEDULING COMPUTER
[8]
Boyan J.A., 1994, Advances in Neural Information Processing Systems, V6
[9]
Gabel T., 2008, P 7 INT C AUT AG MUL, P369
[10]
Gabel T., 2009, THESIS U OSNABRUCK G