A Two-level Reinforcement Learning Algorithm for Ambiguous Mean-variance Portfolio Selection Problem

被引：0

作者：

Huang, Xin ^{[1
]}

Li, Duan ^{[2
]}

机构：

[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China

来源：

PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年

关键词：

POLICY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Traditional modeling on the mean-variance portfolio selection often assumes a full knowledge on statistics of assets' returns. It is, however, not always the case in real financial markets. This paper deals with an ambiguous mean-variance portfolio selection problem with a mixture model on the returns of risky assets, where the proportions of different component distributions are assumed to be unknown to the investor, but being constants (in any time instant). Taking into consideration the updates of proportions from future observations is essential to find an optimal policy with active learning feature, but makes the problem intractable when we adopt the classical methods. Using reinforcement learning, we derive an investment policy with a learning feature in a two-level framework. In the lower level, the time-decomposed approach (dynamic programming) is adopted to solve a family of scenario subcases where in each case the series of component distributions along multiple time periods is specified. At the upper level, a scenario-decomposed approach (progressive hedging algorithm) is applied in order to iteratively aggregate the scenario solutions from the lower layer based on the current knowledge on proportions, and this twolevel solution framework is repeated in a manner of rolling horizon. We carry out experimental studies to illustrate the execution of our policy scheme.

引用

页码：4527 / 4533

页数：7

共 50 条

[1] THE GENERAL MEAN-VARIANCE PORTFOLIO SELECTION PROBLEM
MARKOWITZ, HM
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1994, 347 (1684): : 543 - 549
[2] Continuous-time mean-variance portfolio selection: A reinforcement learning framework
Wang, Haoran
Zhou, Xun Yu
MATHEMATICAL FINANCE, 2020, 30 (04) : 1273 - 1308
[3] ON MEAN-VARIANCE PORTFOLIO SELECTION
SCHNABEL, JA
MANAGERIAL AND DECISION ECONOMICS, 1984, 5 (01) : 3 - 6
[4] Two Possibilistic Mean-Variance Models for Portfolio Selection
Chen, Wei
FUZZY INFORMATION AND ENGINEERING, VOLUME 2, 2009, 62 : 1035 - 1044
[5] TD algorithm for the variance of return and mean-variance reinforcement learning
Sato, Makoto
Kimura, Hajime
Kobayashi, Shibenobu
Transactions of the Japanese Society for Artificial Intelligence, 2001, 16 (03) : 353 - 362
[6] Robust Markowitz mean-variance portfolio selection under ambiguous covariance matrix
Ismail, Amine
Pham, Huyen
MATHEMATICAL FINANCE, 2019, 29 (01) : 174 - 207
[7] Optimal mean-variance portfolio selection
Pedersen, Jesper Lund
Peskir, Goran
MATHEMATICS AND FINANCIAL ECONOMICS, 2017, 11 (02) : 137 - 160
[8] Behavioral mean-variance portfolio selection
Bi, Junna
Jin, Hanging
Meng, Qingbin
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 271 (02) : 644 - 663
[9] Optimal mean-variance portfolio selection
Jesper Lund Pedersen
Goran Peskir
Mathematics and Financial Economics, 2017, 11 : 137 - 160
[10] Robust mean-variance portfolio selection problem including fuzzy factors
Hasuike, Takashi
Ishii, Hiroaki
IMECS 2008: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2008, : 1865 - 1870

← 1 2 3 4 5 →