共 19 条
- [1] [Anonymous], 2012, COLT 2012 25 ANN C L
- [2] Auer P, 2003, SIAM J COMPUT, V32, P48, DOI 10.1137/S0097539701398375
- [4] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [6] Bouneffouf D, 2014, LECT NOTES COMPUT SC, V8836, P373, DOI 10.1007/978-3-319-12643-2_46
- [7] Chapelle O., 2011, ADV NEURAL INFORM PR, V24
- [8] A Neurocomputational Model for Cocaine Addiction [J]. NEURAL COMPUTATION, 2009, 21 (10) : 2869 - 2893
- [9] Beyond pain: modeling decision-making deficits in chronic pain [J]. FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2014, 8
- [10] By carrot or by stick: Cognitive reinforcement learning in Parkinsonism [J]. SCIENCE, 2004, 306 (5703) : 1940 - 1943