Multi-period portfolio optimization using a deep reinforcement learning hyper-heuristic approach

被引:23
作者
Cui, Tianxiang [1 ,2 ]
Du, Nanjiang [1 ,2 ]
Yang, Xiaoying [1 ,2 ]
Ding, Shusheng [3 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, 199 Taikang East Rd, Ningbo 315100, Zhejiang, Peoples R China
[2] Ningbo Key Lab Digital Port Technol, 199 Taikang East Rd, Ningbo 315100, Zhejiang, Peoples R China
[3] Ningbo Univ, Sch Business, 818 Fenghua Rd, Ningbo 315211, Zhejiang, Peoples R China
关键词
Portfolio optimization; Deep reinforcement learning; Hyper-heuristic; Decision making; Uncertainty; SELECTION; MARKET; HYPOTHESIS; STRATEGIES; ALGORITHM; RISK; GAME; GO;
D O I
10.1016/j.techfore.2023.122944
中图分类号
F [经济];
学科分类号
02 ;
摘要
Portfolio optimization concerns with periodically allocating the limited funds to invest in a variety of potential assets in order to satisfy investors' appetites for risk and return goals. Recently, Deep Reinforcement Learning (DRL) has shown its promising capabilities in sequential decision making problems. However, traditional DRL algorithms directly operate in the space of low-level actions, which exhibits poor scalability and becomes intractable in real-world problem instances when the dimensionality of the environment increases. To deal with this, in this work, a novel DRL hyper-heuristic framework is proposed for multi-period portfolio optimization problem. Instead of exploiting the entire action domain, our proposed approach is more effective by searching for low-level well-developed trading strategies. In addition, our proposed approach is data driven and respects the nature of the problem by taking advantage of expert domain knowledge and posing it multidimensional states to further leverage additional diverse information from alternative views of the environment. The proposed approach is evaluated on five real-world capital market problem instances and numerous experimental results demonstrate our proposed method can achieve notable performance gains compared to state-of-art trading strategies as well as traditional DRL baseline method. The data we used are from five stock indices, covering the period from the 2012 to 2022. Our study can have salient policy implications for investment strategy formulation and effective regulatory frameworks establishment.
引用
收藏
页数:11
相关论文
共 78 条
[1]   Deep learning-based exchange rate prediction during the COVID-19 pandemic [J].
Abedin, Mohammad Zoynul ;
Moon, Mahmudul Hasan ;
Hassan, M. Kabir ;
Hajek, Petr .
ANNALS OF OPERATIONS RESEARCH, 2021, 345 (2) :1335-1386
[2]   Solving urban transit route design problem using selection hyper-heuristics [J].
Ahmed, Leena ;
Mumford, Christine ;
Kheiri, Ahmed .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 274 (02) :545-559
[3]   An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown [J].
Almahdi, Saud ;
Yang, Steve Y. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 87 :267-279
[4]   Stock return predictability: Is it there? [J].
Ang, Andrew ;
Bekaert, Geert .
REVIEW OF FINANCIAL STUDIES, 2007, 20 (03) :651-707
[5]  
Appel Gerald., 2005, TECHNICAL ANAL POWER
[6]   Stock return predictability and model uncertainty [J].
Avramov, D .
JOURNAL OF FINANCIAL ECONOMICS, 2002, 64 (03) :423-458
[7]  
BEASLEY JE, 1990, J OPER RES SOC, V41, P1069, DOI 10.2307/2582903
[8]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[9]   Algorithm for cardinality-constrained quadratic optimization [J].
Bertsimas, Dimitris ;
Shioda, Romy .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2009, 43 (01) :1-22
[10]   Estimation of the global minimum variance portfolio in high dimensions [J].
Bodnar, Taras ;
Parolya, Nestor ;
Schmid, Wolfgang .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 266 (01) :371-390