共 40 条
[2]
[Anonymous], 2017, DYNAMIC PROGRAMMING
[3]
[Anonymous], 2007, AITR07339 U TEX AUST
[4]
ANTOS A., 2007, Adv. Neural Inf. Process. Syst., V20
[5]
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
[J].
LEARNING THEORY, PROCEEDINGS,
2006, 4005
:574-588
[6]
Armesto L, YOUTUBE ROBOTICS SYS
[7]
Armesto L, 2021, MEARM ROBOT UPV VERS
[8]
Volume-weighted Bellman error method for adaptive meshing in approximate dynamic programming
[J].
REVISTA IBEROAMERICANA DE AUTOMATICA E INFORMATICA INDUSTRIAL,
2022, 19 (01)
:37-47
[9]
Bertsekas D. P., 2018, ABSTRACT DYNAMIC PRO
[10]
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f