A Two-level Reinforcement Learning Algorithm for Ambiguous Mean-variance Portfolio Selection Problem

被引:0
|
作者
Huang, Xin [1 ]
Li, Duan [2 ]
机构
[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年
关键词
POLICY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional modeling on the mean-variance portfolio selection often assumes a full knowledge on statistics of assets' returns. It is, however, not always the case in real financial markets. This paper deals with an ambiguous mean-variance portfolio selection problem with a mixture model on the returns of risky assets, where the proportions of different component distributions are assumed to be unknown to the investor, but being constants (in any time instant). Taking into consideration the updates of proportions from future observations is essential to find an optimal policy with active learning feature, but makes the problem intractable when we adopt the classical methods. Using reinforcement learning, we derive an investment policy with a learning feature in a two-level framework. In the lower level, the time-decomposed approach (dynamic programming) is adopted to solve a family of scenario subcases where in each case the series of component distributions along multiple time periods is specified. At the upper level, a scenario-decomposed approach (progressive hedging algorithm) is applied in order to iteratively aggregate the scenario solutions from the lower layer based on the current knowledge on proportions, and this twolevel solution framework is repeated in a manner of rolling horizon. We carry out experimental studies to illustrate the execution of our policy scheme.
引用
收藏
页码:4527 / 4533
页数:7
相关论文
共 50 条
  • [1] THE GENERAL MEAN-VARIANCE PORTFOLIO SELECTION PROBLEM
    MARKOWITZ, HM
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1994, 347 (1684): : 543 - 549
  • [2] Continuous-time mean-variance portfolio selection: A reinforcement learning framework
    Wang, Haoran
    Zhou, Xun Yu
    MATHEMATICAL FINANCE, 2020, 30 (04) : 1273 - 1308
  • [3] ON MEAN-VARIANCE PORTFOLIO SELECTION
    SCHNABEL, JA
    MANAGERIAL AND DECISION ECONOMICS, 1984, 5 (01) : 3 - 6
  • [4] Two Possibilistic Mean-Variance Models for Portfolio Selection
    Chen, Wei
    FUZZY INFORMATION AND ENGINEERING, VOLUME 2, 2009, 62 : 1035 - 1044
  • [5] TD algorithm for the variance of return and mean-variance reinforcement learning
    Sato, Makoto
    Kimura, Hajime
    Kobayashi, Shibenobu
    Transactions of the Japanese Society for Artificial Intelligence, 2001, 16 (03) : 353 - 362
  • [6] Robust Markowitz mean-variance portfolio selection under ambiguous covariance matrix
    Ismail, Amine
    Pham, Huyen
    MATHEMATICAL FINANCE, 2019, 29 (01) : 174 - 207
  • [7] Optimal mean-variance portfolio selection
    Pedersen, Jesper Lund
    Peskir, Goran
    MATHEMATICS AND FINANCIAL ECONOMICS, 2017, 11 (02) : 137 - 160
  • [8] Behavioral mean-variance portfolio selection
    Bi, Junna
    Jin, Hanging
    Meng, Qingbin
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 271 (02) : 644 - 663
  • [9] Optimal mean-variance portfolio selection
    Jesper Lund Pedersen
    Goran Peskir
    Mathematics and Financial Economics, 2017, 11 : 137 - 160
  • [10] Robust mean-variance portfolio selection problem including fuzzy factors
    Hasuike, Takashi
    Ishii, Hiroaki
    IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 1865 - 1870