共 108 条
[1]
Akkaya I, 2019, Arxiv, DOI arXiv:1910.07113
[3]
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2008, 38 (04)
:943-949
[4]
Amos Brandon, 2018, Advances in Neural Information Processing Systems, V31
[6]
Beckenbach L, 2020, 2020 EUROPEAN CONTROL CONFERENCE (ECC 2020), P184
[7]
Beckenbach L, 2019, IEEE DECIS CONTR P, P7110, DOI 10.1109/CDC40024.2019.9030185
[8]
Beckenbach L, 2018, 2018 EUROPEAN CONTROL CONFERENCE (ECC), P1349, DOI 10.23919/ECC.2018.8550545
[9]
Addressing infinite-horizon optimization in MPC via Q-learning
[J].
IFAC PAPERSONLINE,
2018, 51 (20)
:60-65
[10]
Berkenkamp F., 2019, Safe exploration in reinforcement learning: Theory and applications in robotics