Applications of Markov Decision Process Model and Deep Learning in Quantitative Portfolio Management during the COVID-19 Pandemic

被引:5
作者
Yue, Han [1 ]
Liu, Jiapeng [1 ]
Zhang, Qin [1 ]
机构
[1] China Jiliang Univ, Coll Econ & Management, Hangzhou 310018, Peoples R China
关键词
Markov decision process model; quantitative portfolio management; deep reinforcement learning; deep learning; omega ratio; TRADING SYSTEM; REINFORCEMENT; DIMENSIONALITY; STRATEGIES; RISK;
D O I
10.3390/systems10050146
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Whether for institutional investors or individual investors, there is an urgent need to explore autonomous models that can adapt to the non-stationary, low-signal-to-noise markets. This research aims to explore the two unique challenges in quantitative portfolio management: (1) the difficulty of representation and (2) the complexity of environments. In this research, we suggest a Markov decision process model-based deep reinforcement learning model including deep learning methods to perform strategy optimization, called SwanTrader. To achieve better decisions of the portfolio-management process from two different perspectives, i.e., the temporal patterns analysis and robustness information capture based on market observations, we suggest an optimal deep learning network in our model that incorporates a stacked sparse denoising autoencoder (SSDAE) and a long-short-term-memory-based autoencoder (LSTM-AE). The findings in times of COVID-19 show that the suggested model using two deep learning models gives better results with an alluring performance profile in comparison with four standard machine learning models and two state-of-the-art reinforcement learning models in terms of Sharpe ratio, Calmar ratio, and beta and alpha values. Furthermore, we analyzed which deep learning models and reward functions were most effective in optimizing the agent's management decisions. The results of our suggested model for investors can assist in reducing the risk of investment loss as well as help them to make sound decisions.
引用
收藏
页数:20
相关论文
共 74 条
[1]  
Agarwal A., 2006, ACM P 23 INT C MACH, P9, DOI DOI 10.1145/1143844.1143846
[2]   An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown [J].
Almahdi, Saud ;
Yang, Steve Y. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 87 :267-279
[3]  
[Anonymous], 2000, Value at Risk: The New Benchmark for Managing Financial Risk
[4]  
[Anonymous], 2009, P 26 ANN INT C MACH, DOI 10.1145/1553374.1553505
[5]   DDSA: A Defense Against Adversarial Attacks Using Deep Denoising Sparse Autoencoder [J].
Bakhti, Yassine ;
Fezza, Sid Ahmed ;
Hamidouche, Wassim ;
Deforges, Olivier .
IEEE ACCESS, 2019, 7 :160397-160407
[6]   A deep learning framework for financial time series using stacked autoencoders and long-short term memory [J].
Bao, Wei ;
Yue, Jun ;
Rao, Yulei .
PLOS ONE, 2017, 12 (07)
[7]   Customer Experience in Fintech [J].
Barbu, Catalin Mihail ;
Florea, Dorian Laurentiu ;
Dabija, Dan-Cristian ;
Barbu, Mihai Constantin Razvan .
JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2021, 16 (05) :1415-1433
[8]  
Benhamou E, 2019, Arxiv, DOI arXiv:1911.10254
[9]  
Betancourt C., 2021, Expert Syst. Appl., V164, DOI DOI 10.1016/J.ESWA.2020.114002
[10]  
Bin L., 2022, EC MANAG FINANC MARK, V17, P9, DOI [10.22381/emfm17120221, DOI 10.22381/EMFM17120221]