Wealth Flow Model: Online Portfolio Selection Based on Learning Wealth Flow Matrices

被引:5
作者
Yin, Jianfei [1 ,2 ]
Wang, Ruili [3 ]
Guo, Yeqing [4 ]
Bai, Yizhe [5 ]
Ju, Shunda [5 ]
Liu, Weili [5 ]
Huang, Joshua Zhexue [5 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen 518060, Peoples R China
[3] Massey Univ, Sch Nat & Computat Sci, Auckland 102904, New Zealand
[4] Tisson Regaltc Commun Technol, Guangzhou 510623, Peoples R China
[5] Shenzhen Univ, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Online portfolio selection; wealth flow matrix; deep reinforcement learning regret bound; UNIVERSAL PORTFOLIOS; LATENT STRUCTURE; OPTIMIZATION; STRATEGIES; REVERSION;
D O I
10.1145/3464308
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes a deep learning solution to the online portfolio selection problem based on learning a latent structure directly from a price time series. It introduces a novel wealth flow matrix for representing a latent structure that has special regular conditions to encode the knowledge about the relative strengths of assets in portfolios. Therefore, a wealth flow model (WFM) is proposed to learn wealth flow matrices and maximize portfolio wealth simultaneously. Compared with existing approaches, our work has several distinctive benefits: (1) the learning of wealth flow matrices makes our model more generalizable than models that only predict wealth proportion vectors, and (2) the exploitation of wealth flow matrices and the exploration of wealth growth are integrated into our deep reinforcement algorithm for the WFM. These benefits, in combination, lead to a highly-effective approach for generating reasonable investment behavior, including short-term trend following, the following of a few losers, no self-investment, and sparse portfolios. Extensive experiments on five benchmark datasets from real-world stock markets confirm the theoretical advantage of the WFM, which achieves the Pareto improvements in terms of multiple performance indicators and the steady growth of wealth over the state-of-the-art algorithms.
引用
收藏
页数:27
相关论文
共 67 条
  • [51] Ortner R., 2019, PR MACH LEARN RES, P81
  • [52] Revisiting Maximum-A-Posteriori Estimation in Log-Concave Models
    Pereyra, Marcelo
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2019, 12 (01) : 650 - 670
  • [53] Kullback-Leibler Divergence Estimation of Continuous Distributions
    Perez-Cruz, Fernando
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS, VOLS 1-6, 2008, : 1666 - 1670
  • [54] Machine behaviour
    Rahwan, Iyad
    Cebrian, Manuel
    Obradovich, Nick
    Bongard, Josh
    Bonnefon, Jean-Francois
    Breazeal, Cynthia
    Crandall, Jacob W.
    Christakis, Nicholas A.
    Couzin, Iain D.
    Jackson, Matthew O.
    Jennings, Nicholas R.
    Kamar, Ece
    Kloumann, Isabel M.
    Larochelle, Hugo
    Lazer, David
    McElreath, Richard
    Mislove, Alan
    Parkes, David C.
    Pentland, Alex 'Sandy'
    Roberts, Margaret E.
    Shariff, Azim
    Tenenbaum, Joshua B.
    Wellman, Michael
    [J]. NATURE, 2019, 568 (7753) : 477 - 486
  • [55] Mean reversion in the US stock market
    Serletis, Apostolos
    Rosenberg, Aryeh Adam
    [J]. CHAOS SOLITONS & FRACTALS, 2009, 40 (04) : 2007 - 2015
  • [56] Temporal pattern attention for multivariate time series forecasting
    Shih, Shun-Yao
    Sun, Fan-Keng
    Lee, Hung-yi
    [J]. MACHINE LEARNING, 2019, 108 (8-9) : 1421 - 1441
  • [57] Mastering the game of Go without human knowledge
    Silver, David
    Schrittwieser, Julian
    Simonyan, Karen
    Antonoglou, Ioannis
    Huang, Aja
    Guez, Arthur
    Hubert, Thomas
    Baker, Lucas
    Lai, Matthew
    Bolton, Adrian
    Chen, Yutian
    Lillicrap, Timothy
    Hui, Fan
    Sifre, Laurent
    van den Driessche, George
    Graepel, Thore
    Hassabis, Demis
    [J]. NATURE, 2017, 550 (7676) : 354 - +
  • [58] van Hasselt H, 2016, AAAI CONF ARTIF INTE, P2094
  • [59] Vaswani A, 2017, ADV NEUR IN, V30
  • [60] Xing Frank, 2020, P 28 INT C COMP LING, P978