共 10 条
- [1] [Anonymous], 1999, MOMUC99
- [2] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [3] Cortical substrates for exploratory decisions in humans [J]. NATURE, 2006, 441 (7095) : 876 - 879
- [5] Efficient decision-making by volume-conserving physical object [J]. NEW JOURNAL OF PHYSICS, 2015, 17
- [6] Amoeba-inspired algorithm for cognitive medium access [J]. IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2014, 5 (02): : 198 - 209
- [8] Medium Access in Cognitive Radio Networks: A Competitive Multi-armed Bandit Framework [J]. 2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 98 - +
- [10] Sutton R.S., 2017, REINFORCEMENT LEARNI, V2, DOI DOI 10.1093/cercor/bhw013