共 18 条
- [11] Russel SJ., 2010, Artificial intelligence: a modern approach, V3rd
- [12] Strekalovsky A.S., 2003, Elementy nevypukloi optimizatsii (Elements of Nonconvex Optimization)
- [13] Sutton RS., 1998, INTRO REINFORCEMENT, DOI [10.1109/TNN.1998.712192, DOI 10.1109/TNN.1998.712192]
- [14] van Hasselt H, 2016, AAAI CONF ARTIF INTE, P2094
- [15] Wang C., 2022, P INT C LEARN REPR I
- [16] Wasserman P., 1989, Neural Computing Theory and Practice
- [17] WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698
- [18] Wiering M, 2012, ADAPT LEARN OPTIM, V12, P1, DOI 10.1007/978-3-642-27645-3