共 30 条
[11]
Bertsekas DP., 2016, Nonlinear Programming, V3
[12]
Bertsekas DP, 2020, IEEE/CAA J Autom Sin
[13]
Bertsekas DP, 2010, Williams-baird counterexample for Q-factor asynchronous policy iteration
[16]
TEAM DECISION-THEORY AND INFORMATION STRUCTURES
[J].
PROCEEDINGS OF THE IEEE,
1980, 68 (06)
:644-654
[19]
Li YY, 2020, Arxiv, DOI arXiv:1912.09135