Risk-Averse Stochastic Convex Bandit

被引：0

作者：

Cardoso, Adrian Rivera ^{[1
]}

Xu, Huan ^{[1
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89 | 2019年 / 89卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Motivated by applications in clinical trials and finance, we study the problem of online convex optimization (with bandit feedback) where the decision maker is risk-averse. We provide two algorithms to solve this problem. The first one is a descent-type algorithm which is easy to implement. The second algorithm, which combines the ellipsoid method and a center point device, achieves (almost) optimal regret bounds with respect to the number of rounds. To the best of our knowledge this is the first attempt to address risk-aversion in the online convex bandit problem.

引用

页码：39 / 47

页数：9

共 50 条

[41] Robust Risk-Averse Stochastic Multi-armed Bandits
Maillard, Odalric-Ambrym
ALGORITHMIC LEARNING THEORY (ALT 2013), 2013, 8139 : 218 - 233
[42] An approximation scheme for a class of risk-averse stochastic equilibrium problems
Luna, Juan Pablo
Sagastizabal, Claudia
Solodov, Mikhail
MATHEMATICAL PROGRAMMING, 2016, 157 (02) : 451 - 481
[43] ARE BANKS RISK-AVERSE?
Nishiyama, Yasuo
EASTERN ECONOMIC JOURNAL, 2007, 33 (04) : 471 - 490
[44] Robust stochastic dominance and its application to risk-averse optimization
Dentcheva, Darinka
Ruszczynski, Andrzej
MATHEMATICAL PROGRAMMING, 2010, 123 (01) : 85 - 100
[45] A Risk-Averse Newsvendor Model under Stochastic Market Price
Zhang, Huirong
Zhang, Zhenyu
Zhang, Jiaping
DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2021, 2021
[46] Scenario decomposition of risk-averse multistage stochastic programming problems
Ricardo A. Collado
Dávid Papp
Andrzej Ruszczyński
Annals of Operations Research, 2012, 200 : 147 - 170
[47] Stochastic generalized standard materials and risk-averse effective behavior
Bleyer, Jeremy
JOURNAL OF THE MECHANICS AND PHYSICS OF SOLIDS, 2025, 195
[48] Strong convexity in risk-averse stochastic programs with complete recourse
Claus, Matthias
Schultz, Ruediger
Spuerkel, Kai
COMPUTATIONAL MANAGEMENT SCIENCE, 2018, 15 (3-4) : 411 - 429
[49] Robust stochastic dominance and its application to risk-averse optimization
Darinka Dentcheva
Andrzej Ruszczyński
Mathematical Programming, 2010, 123 : 85 - 100
[50] Risk-Averse Equilibria for Vehicle Navigation in Stochastic Congestion Games
Yekkehkhany, Ali
Nagi, Rakesh
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 18719 - 18735

← 1 2 3 4 5 →