共 23 条
[1]
Abbasi-Yadkori Y, 2014, PR MACH LEARN RES, V32, P496
[2]
[Anonymous], 2007, Approximate dynamic programming: Solving the curses of dimensionality
[3]
Bertsekas D. P., 1995, Dynamic programming and optimal control, V1-2
[4]
Bertsekas D, 2020, Arxiv, DOI arXiv:1910.00120
[5]
Multiagent value iteration algorithms in dynamic programming and reinforcement learning
[J].
RESULTS IN CONTROL AND OPTIMIZATION,
2020, 1
[7]
Bowling M., 2000, Technical report
[8]
Campbell T, 2013, P AMER CONTR CONF, P2356
[9]
Chen Ziyi, 2022, P MACHINE LEARNING R