Risk-Averse Stochastic Convex Bandit

被引:0
|
作者
Cardoso, Adrian Rivera [1 ]
Xu, Huan [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
来源
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89 | 2019年 / 89卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motivated by applications in clinical trials and finance, we study the problem of online convex optimization (with bandit feedback) where the decision maker is risk-averse. We provide two algorithms to solve this problem. The first one is a descent-type algorithm which is easy to implement. The second algorithm, which combines the ellipsoid method and a center point device, achieves (almost) optimal regret bounds with respect to the number of rounds. To the best of our knowledge this is the first attempt to address risk-aversion in the online convex bandit problem.
引用
收藏
页码:39 / 47
页数:9
相关论文
共 50 条
  • [21] A Note on Stability for Risk-Averse Stochastic Complementarity Problems
    Burtscheidt, Johanna
    Claus, Matthias
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2017, 172 (01) : 298 - 308
  • [22] Robust multicriteria risk-averse stochastic programming models
    Liu, Xiao
    Kucukyavuz, Simge
    Noyan, Nilay
    ANNALS OF OPERATIONS RESEARCH, 2017, 259 (1-2) : 259 - 294
  • [23] Risk-Averse Regret Minimization in Multistage Stochastic Programs
    Poursoltani, Mehran
    Delage, Erick
    Georghiou, Angelos
    OPERATIONS RESEARCH, 2024, 72 (04) : 1727 - 1738
  • [24] Multilevel Optimization Modeling for Risk-Averse Stochastic Programming
    Eckstein, Jonathan
    Eskandani, Deniz
    Fan, Jingnan
    INFORMS JOURNAL ON COMPUTING, 2016, 28 (01) : 112 - 128
  • [25] Dual SDDP for risk-averse multistage stochastic programs
    da Costa, Bernardo Freitas Paulo
    Leclere, Vincent
    OPERATIONS RESEARCH LETTERS, 2023, 51 (03) : 332 - 337
  • [26] On Risk-Averse Stochastic Semidefinite Programs with Continuous Recourse
    Matthias Claus
    Rüdiger Schultz
    Kai Spürkel
    Tobias Wollenberg
    Vietnam Journal of Mathematics, 2019, 47 : 865 - 879
  • [27] On Risk-Averse Stochastic Semidefinite Programs with Continuous Recourse
    Claus, Matthias
    Schultz, Ruediger
    Spuerkel, Kai
    Wollenberg, Tobias
    VIETNAM JOURNAL OF MATHEMATICS, 2019, 47 (04) : 865 - 879
  • [28] Risk-Averse No-Regret Learning in Online Convex Games
    Wang, Zifan
    Shen, Yi
    Zavlanos, Michael M.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [29] Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs
    Lin, Yifan
    Wang, Yuhao
    Zhou, Enlu
    JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2022,
  • [30] Risk-averse Contextual Multi-armed Bandit Problem with Linear Payoffs
    Lin, Yifan
    Wang, Yuhao
    Zhou, Enlu
    JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2023, 32 (03) : 267 - 288