共 50 条
[13]
IMPROVING STRATEGIES FOR THE MULTI-ARMED BANDIT
[J].
MARKOV PROCESS AND CONTROL THEORY,
1989, 54
:158-163
[15]
THE MULTI-ARMED BANDIT PROBLEM WITH COVARIATES
[J].
ANNALS OF STATISTICS,
2013, 41 (02)
:693-721
[16]
The Multi-fidelity Multi-armed Bandit
[J].
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016),
2016, 29
[17]
Multi-armed Bandit with Additional Observations
[J].
2018, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (46)
:53-55
[18]
Mean Field Equilibrium in Multi-Armed Bandit Game with Continuous Reward
[J].
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021,
2021,
:3118-3124
[20]
CONTEXTUAL MULTI-ARMED BANDIT ALGORITHMS FOR PERSONALIZED LEARNING ACTION SELECTION
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP),
2017,
:6344-6348