Risk-Averse Stochastic Convex Bandit

被引：0

作者：

Cardoso, Adrian Rivera ^{[1
]}

Xu, Huan ^{[1
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89 | 2019年 / 89卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Motivated by applications in clinical trials and finance, we study the problem of online convex optimization (with bandit feedback) where the decision maker is risk-averse. We provide two algorithms to solve this problem. The first one is a descent-type algorithm which is easy to implement. The second algorithm, which combines the ellipsoid method and a center point device, achieves (almost) optimal regret bounds with respect to the number of rounds. To the best of our knowledge this is the first attempt to address risk-aversion in the online convex bandit problem.

引用

页码：39 / 47

页数：9

共 50 条

[21] A Note on Stability for Risk-Averse Stochastic Complementarity Problems
Burtscheidt, Johanna
Claus, Matthias
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2017, 172 (01) : 298 - 308
[22] Robust multicriteria risk-averse stochastic programming models
Liu, Xiao
Kucukyavuz, Simge
Noyan, Nilay
ANNALS OF OPERATIONS RESEARCH, 2017, 259 (1-2) : 259 - 294
[23] Risk-Averse Regret Minimization in Multistage Stochastic Programs
Poursoltani, Mehran
Delage, Erick
Georghiou, Angelos
OPERATIONS RESEARCH, 2024, 72 (04) : 1727 - 1738
[24] Multilevel Optimization Modeling for Risk-Averse Stochastic Programming
Eckstein, Jonathan
Eskandani, Deniz
Fan, Jingnan
INFORMS JOURNAL ON COMPUTING, 2016, 28 (01) : 112 - 128
[25] Dual SDDP for risk-averse multistage stochastic programs
da Costa, Bernardo Freitas Paulo
Leclere, Vincent
OPERATIONS RESEARCH LETTERS, 2023, 51 (03) : 332 - 337
[26] On Risk-Averse Stochastic Semidefinite Programs with Continuous Recourse
Matthias Claus
Rüdiger Schultz
Kai Spürkel
Tobias Wollenberg
Vietnam Journal of Mathematics, 2019, 47 : 865 - 879
[27] On Risk-Averse Stochastic Semidefinite Programs with Continuous Recourse
Claus, Matthias
Schultz, Ruediger
Spuerkel, Kai
Wollenberg, Tobias
VIETNAM JOURNAL OF MATHEMATICS, 2019, 47 (04) : 865 - 879
[28] Risk-Averse No-Regret Learning in Online Convex Games
Wang, Zifan
Shen, Yi
Zavlanos, Michael M.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[29] Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs
Lin, Yifan
Wang, Yuhao
Zhou, Enlu
JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2022,
[30] Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs
Lin, Yifan
Wang, Yuhao
Zhou, Enlu
JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2023, 32 (03) : 267 - 288

← 1 2 3 4 5 →