共 40 条
[2]
Altman E., 1999, Constrained Markov Decision Processes
[3]
Ayesta Urtzi, 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton), P377
[7]
Bertsekas D.P., 2019, Reinforcement Learning and Optimal Control
[9]
Bertsekas DP., 2002, INTRO PROBABILITY, V1