Markov decision processes under model uncertainty

被引:3
作者
Neufeld, Ariel [1 ,4 ]
Sester, Julian [2 ]
Sikic, Mario [3 ]
机构
[1] NTU Singapore, Div Math Sci, Singapore, Singapore
[2] Natl Univ Singapore, Dept Math, Singapore, Singapore
[3] Univ Zurich, Dept Banking & Finance, Zurich, Switzerland
[4] NTU Singapore, Div Math Sci, 21 Nanyang Link, Singapore 637371, Singapore
关键词
ambiguity; dynamic programming principle; Markov decision problem; portfolio optimization; ROBUST UTILITY MAXIMIZATION; PORTFOLIO OPTIMIZATION; OPTIMAL INVESTMENT; STRATEGIES; ALGORITHMS;
D O I
10.1111/mafi.12381
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
We introduce a general framework for Markov decision problems under model uncertainty in a discrete-time infinite horizon setting. By providing a dynamic programming principle, we obtain a local-to-global paradigm, namely solving a local, that is, a one time-step robust optimization problem leads to an optimizer of the global (i.e., infinite time-steps) robust stochastic optimal control problem, as well as to a corresponding worst-case measure. Moreover, we apply this framework to portfolio optimization involving data of the S&P500$S\&P\nobreakspace 500$. We present two different types of ambiguity sets; one is fully data-driven given by a Wasserstein-ball around the empirical measure, the second one is described by a parametric set of multivariate normal distributions, where the corresponding uncertainty sets of the parameters are estimated from the data. It turns out that in scenarios where the market is volatile or bearish, the optimal portfolio strategies from the corresponding robust optimization problem outperforms the ones without model uncertainty, showcasing the importance of taking model uncertainty into account.
引用
收藏
页码:618 / 665
页数:48
相关论文
共 73 条
  • [1] Swapping the nested fixed point algorithm: A class of estimators for discrete Markov decision models
    Aguirregabiria, V
    Mira, P
    [J]. ECONOMETRICA, 2002, 70 (04) : 1519 - 1543
  • [2] Aliprantis C. D., 2006, INFINITE DIMENSIONAL, V3rd
  • [3] Angiuli A., 2022, REINFORCEMENT LEARNI
  • [4] Angiuli A., 2021, REINFORCEMENT LEARNI
  • [5] Bäuerle N, 2011, UNIVERSITEXT, P1, DOI 10.1007/978-3-642-18324-9
  • [6] MDP algorithms for portfolio optimization problems in pure jump markets
    Baeuerle, Nicole
    Rieder, Ulrich
    [J]. FINANCE AND STOCHASTICS, 2009, 13 (04) : 591 - 611
  • [7] Duality theory for robust utility maximisation
    Bartl, Daniel
    Kupper, Michael
    Neufeld, Ariel
    [J]. FINANCE AND STOCHASTICS, 2021, 25 (03) : 469 - 503
  • [8] Robust expected utility maximization with medial limits
    Bartl, Daniel
    Cheridito, Patrick
    Kupper, Michael
    [J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2019, 471 (1-2) : 752 - 775
  • [9] EXPONENTIAL UTILITY MAXIMIZATION UNDER MODEL UNCERTAINTY FOR UNBOUNDED ENDOWMENTS
    Bartl, Daniel
    [J]. ANNALS OF APPLIED PROBABILITY, 2019, 29 (01) : 577 - 612
  • [10] Bauerle Nicole, 2021, Modern Trends in Controlled Stochastic Processes, P108