共 27 条
- [3] Bertsekas DP, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P560, DOI 10.1109/CDC.1995.478953
- [5] Dunn W L., 2022, Exploring Monte Carlo Methods
- [6] Fortunato M, 2019, Arxiv, DOI arXiv:1706.10295
- [8] Howard R. A., 1960, Dynamic programming and markov processes
- [10] Bandit based Monte-Carlo planning [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293