Curriculum learning empowered reinforcement learning for graph-based portfolio management: Performance optimization and comprehensive analysis

被引:3
作者
Salamai, Abdullah Ali [1 ]
机构
[1] Jazan Univ, Appl Coll, Dept Management, Jazan, Saudi Arabia
关键词
Portfolio management; Curriculum learning; Transformer network; Graph neural networks; Deep reinforcement learning; PREDICTION;
D O I
10.1016/j.neunet.2024.106537
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Portfolio management (PM) is a popular financial process that concerns the occasional reallocation of a particular quantity of capital into a portfolio of assets, with the main aim of maximizing profitability conditioned to a certain level of risk. Given the inherent dynamicity of stock exchanges and development for long-term performance, reinforcement learning (RL) has become a dominating solution for solving the problem of portfolio management in an automated and efficient manner. Nevertheless, the present RL-based PM methods just take into account the variations in prices of portfolio assets and the implications of price variations, while overlooking the significant relationships among different assets in the market, which are extremely valuable for managerial decisions. To close this gap, this paper introduces a novel deep model that combines two subnetworks; one to learn a temporal representation of historical prices using a refined temporal learner, while the other learns the relationships between different stocks in the market using a relation graph learner (RGL). Then, the above learners are integrated into the curriculum RL scheme for formulating the PM as a curriculum Markov Decision Process, in which an adaptive curriculum policy is presented to enable the agent to adaptively minimize risk value and maximize cumulative return. Proof-of-concept experiments are performed on data from three public stock indices (namely S&P500, NYSE, and NASDAQ), and the results demonstrate the efficiency of the proposed framework in improving the portfolio management performance over the competing RL solutions.
引用
收藏
页数:19
相关论文
共 53 条
[1]   Continuous control with Stacked Deep Dynamic Recurrent Reinforcement Learning for portfolio optimization [J].
Aboussalah, Amine Mohamed ;
Lee, Chi-Guhn .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
[2]  
Alam R., 2020, EMERGING TECHNOLOGIE, V3, P214
[3]   A Survey on Deep Reinforcement Learning for Data Processing and Analytics [J].
Cai, Qingpeng ;
Cui, Can ;
Xiong, Yiyuan ;
Wang, Wei ;
Xie, Zhongle ;
Zhang, Meihui .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) :4446-4465
[4]   Reinforcement Learning in Economics and Finance [J].
Charpentier, Arthur ;
Elie, Romuald ;
Remlinger, Carl .
COMPUTATIONAL ECONOMICS, 2023, 62 (01) :425-462
[5]   A novel graph convolutional feature based convolutional neural network for stock trend prediction [J].
Chen, Wei ;
Jiang, Manrui ;
Zhang, Wei-Guo ;
Chen, Zhensong .
INFORMATION SCIENCES, 2021, 556 :67-94
[6]   Mean-variance portfolio optimization using machine learning-based stock price prediction [J].
Chen, Wei ;
Zhang, Haoyu ;
Mehlawat, Mukesh Kumar ;
Jia, Lifen .
APPLIED SOFT COMPUTING, 2021, 100
[7]   Financial time series forecasting with multi-modality graph neural network [J].
Cheng, Dawei ;
Yang, Fangzhou ;
Xiang, Sheng ;
Liu, Jin .
PATTERN RECOGNITION, 2022, 121
[8]   Universal portfolios with side information [J].
Cover, TM ;
Ordentlich, E .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (02) :348-363
[9]   Temporal Relational Ranking for Stock Prediction [J].
Feng, Fuli ;
He, Xiangnan ;
Wang, Xiang ;
Luo, Cheng ;
Liu, Yiqun ;
Chua, Tat-Seng .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2019, 37 (02)
[10]   DELAFO: An Efficient Portfolio Optimization Using Deep Neural Networks [J].
Hieu K Cao ;
Han K Cao ;
Binh T Nguyen .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 :623-635