共 34 条
- [1] [Anonymous], 2013, Artificial intelligence and statistics
- [2] [Anonymous], 2005, THEORY PRACTICE REVE, DOI DOI 10.1007/B139000
- [3] [Anonymous], 2015, Surveys in Operations Research and Management Science, DOI DOI 10.1016/J.SORMS.2015.03.001
- [4] [Anonymous], 2016, 29 C LEARN THEOR COL
- [5] [Anonymous], 2009, P C LEARN THEOR COLT
- [6] Dynamic Pricing for Nonperishable Products with Demand Learning [J]. OPERATIONS RESEARCH, 2009, 57 (05) : 1169 - 1188
- [7] Ashwinkumar B., 2014, PMLR, P1109
- [8] Finite-time analysis of the multiarmed bandit problem [J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
- [9] Aviv Y., 2012, The Oxford Handbook of Pricing Management, P522
- [10] Bandits with Knapsacks (Extended Abstract) [J]. 2013 IEEE 54TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2013, : 207 - 216