共 18 条
- [1] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [3] Chang M. K., 2020, IEEE T GEOSCIENCE RE, V99, P1
- [4] Flammini M, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1353
- [5] Hipel K. W, 2018, IEEE T SYS, P1
- [6] Bandit based Monte-Carlo planning [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293
- [8] Myerson R. B., 1977, Mathematics of Operations Research, V2, P225, DOI 10.1287/moor.2.3.225
- [9] Policiuc A. A, 2020, CORR