共 48 条
[1]
Achiam J, 2017, PR MACH LEARN RES, V70
[2]
Altman E., 1999, Constrained Markov Decision Processes, V7
[3]
Bertsekas D., 2012, Dynamic programming and optimal control, VI
[4]
Bertsekas D, 2016, NONLINEAR PROGRAMMIN, V4
[8]
Boyd SP., 2004, Convex optimization, DOI 10.1017/CBO9780511804441
[9]
Bu JJ, 2019, Arxiv, DOI arXiv:1907.08921
[10]
A Risk-Sensitive Finite-Time Reachability Approach for Safety of Stochastic Dynamic Systems
[J].
2019 AMERICAN CONTROL CONFERENCE (ACC),
2019,
:2958-2963