Markov decision processes under model uncertainty

被引：3

作者：

Neufeld, Ariel ^{[1
,4
]}

Sester, Julian ^{[2
]}

Sikic, Mario ^{[3
]}

机构：

[1] NTU Singapore, Div Math Sci, Singapore, Singapore

[2] Natl Univ Singapore, Dept Math, Singapore, Singapore

[3] Univ Zurich, Dept Banking & Finance, Zurich, Switzerland

[4] NTU Singapore, Div Math Sci, 21 Nanyang Link, Singapore 637371, Singapore

来源：

MATHEMATICAL FINANCE | 2023年 / 33卷 / 03期

关键词：

ambiguity; dynamic programming principle; Markov decision problem; portfolio optimization; ROBUST UTILITY MAXIMIZATION; PORTFOLIO OPTIMIZATION; OPTIMAL INVESTMENT; STRATEGIES; ALGORITHMS;

D O I：

10.1111/mafi.12381

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

We introduce a general framework for Markov decision problems under model uncertainty in a discrete-time infinite horizon setting. By providing a dynamic programming principle, we obtain a local-to-global paradigm, namely solving a local, that is, a one time-step robust optimization problem leads to an optimizer of the global (i.e., infinite time-steps) robust stochastic optimal control problem, as well as to a corresponding worst-case measure. Moreover, we apply this framework to portfolio optimization involving data of the S&P500$S\&P\nobreakspace 500$. We present two different types of ambiguity sets; one is fully data-driven given by a Wasserstein-ball around the empirical measure, the second one is described by a parametric set of multivariate normal distributions, where the corresponding uncertainty sets of the parameters are estimated from the data. It turns out that in scenarios where the market is volatile or bearish, the optimal portfolio strategies from the corresponding robust optimization problem outperforms the ones without model uncertainty, showcasing the importance of taking model uncertainty into account.

引用

页码：618 / 665

页数：48

共 73 条

[1] Swapping the nested fixed point algorithm: A class of estimators for discrete Markov decision models
Aguirregabiria, V
Mira, P
[J]. ECONOMETRICA, 2002, 70 (04) : 1519 - 1543
[2] Aliprantis C. D., 2006, INFINITE DIMENSIONAL, V3rd
[3] Angiuli A., 2022, REINFORCEMENT LEARNI
[4] Angiuli A., 2021, REINFORCEMENT LEARNI
[5] Bäuerle N, 2011, UNIVERSITEXT, P1, DOI 10.1007/978-3-642-18324-9
[6] MDP algorithms for portfolio optimization problems in pure jump markets
Baeuerle, Nicole
Rieder, Ulrich
[J]. FINANCE AND STOCHASTICS, 2009, 13 (04) : 591 - 611
[7] Duality theory for robust utility maximisation
Bartl, Daniel
Kupper, Michael
Neufeld, Ariel
[J]. FINANCE AND STOCHASTICS, 2021, 25 (03) : 469 - 503
[8] Robust expected utility maximization with medial limits
Bartl, Daniel
Cheridito, Patrick
Kupper, Michael
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2019, 471 (1-2) : 752 - 775
[9] EXPONENTIAL UTILITY MAXIMIZATION UNDER MODEL UNCERTAINTY FOR UNBOUNDED ENDOWMENTS
Bartl, Daniel
[J]. ANNALS OF APPLIED PROBABILITY, 2019, 29 (01) : 577 - 612
[10] Bauerle Nicole, 2021, Modern Trends in Controlled Stochastic Processes, P108

← 1 2 3 4 5 6 7 8 →