共 48 条
- [11] Chang HS, 2013, COMMUN CONTROL ENG, P1, DOI 10.1007/978-1-4471-5022-0
- [12] DAI J. G., 2022, Stochastic Systems, V12, P30, DOI 10.1287/STSY.2021. 0081
- [13] Q-Learning With Uniformly Bounded Variance [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (11) : 5948 - 5963
- [14] Dewanto V, 2021, Arxiv, DOI arXiv:2010.08920
- [15] Durrett R., 2019, Probability: Theory and Examples, Vfifth
- [18] Glasserman P., 2004, Monte Carlo methods in financial engineering, V53
- [20] Hu J., A Q-learning algorithm for Markov decision processes with continuous state spaces