共 32 条
[31]
Auer P., Cesa-Bianchi N., Fischer P., Finite-time analysis of the multiarmed bandit problem, Machine Learn, 47, pp. 235-256, (2002)
[32]
Kim S.-J., Aono M., Nameda E., Efficient decision-making by volume-conserving physical object, New J Phys, 17, 8, (2015)