Smart Robotic Strategies and Advice for Stock Trading Using Deep Transformer Reinforcement Learning

被引:4
作者
Malibari, Nadeem [1 ]
Katib, Iyad [1 ]
Mehmood, Rashid [2 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia
[2] King Abdulaziz Univ, High Performance Comp Ctr, Jeddah 21589, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 24期
关键词
stock trading; transformer; deep reinforcement learning; machine learning; Tadawul; stocks; robotic advice; robotic strategies; TIME-SERIES; PERFORMANCE;
D O I
10.3390/app122412526
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The many success stories of reinforcement learning (RL) and deep learning (DL) techniques have raised interest in their use for detecting patterns and generating constant profits from financial markets. In this paper, we combine deep reinforcement learning (DRL) with a transformer network to develop a decision transformer architecture for online trading. We use data from the Saudi Stock Exchange (Tadawul), one of the largest liquid stock exchanges globally. Specifically, we use the indices of four firms: Saudi Telecom Company, Al-Rajihi Banking and Investment, Saudi Electricity Company, and Saudi Basic Industries Corporation. To ensure the robustness and risk management of the proposed model, we consider seven reward functions: the Sortino ratio, cumulative returns, annual volatility, omega, the Calmar ratio, max drawdown, and normal reward without any risk adjustments. Our proposed DRL-based model provided the highest average increase in the net worth of Saudi Telecom Company, Saudi Electricity Company, Saudi Basic Industries Corporation, and Al-Rajihi Banking and Investment at 21.54%, 18.54%, 17%, and 19.36%, respectively. The Sortino ratio, cumulative returns, and annual volatility were found to be the best-performing reward functions. This work makes significant contributions to trading regarding long-term investment and profit goals.
引用
收藏
页数:33
相关论文
共 66 条
[41]  
Padial D.L., TECHNICAL ANAL LIB U
[42]   Reinforcement Learning in Stock Trading [J].
Quang-Vinh Dang .
ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING (ICCSAMA 2019), 2020, 1121 :311-322
[43]  
Rahm E., 2001, IEEE DATA ENG B TECH, V24, P1
[44]   Transforming variables to central normality [J].
Raymaekers, Jakob ;
Rousseeuw, Peter J. .
MACHINE LEARNING, 2024, 113 (08) :4953-4975
[45]  
Sadighian J., 2020, ARXIV
[46]  
saudiexchange, PORTAL TADAWUL KNOWL
[47]  
Selvin S, 2017, 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), P1643, DOI 10.1109/ICACCI.2017.8126078
[48]   THE SHARPE RATIO [J].
SHARPE, WF .
JOURNAL OF PORTFOLIO MANAGEMENT, 1994, 21 (01) :49-58
[49]   Missing value imputation on missing completely at random data using multilayer perceptrons [J].
Silva-Ramirez, Esther-Lydia ;
Pino-Mejias, Rafael ;
Lopez-Coello, Manuel ;
Cubiles-de-la-Vega, Maria-Dolores .
NEURAL NETWORKS, 2011, 24 (01) :121-129
[50]   The use of multiple imputation for the analysis of missing data [J].
Sinharay, S ;
Stern, HS ;
Russell, D .
PSYCHOLOGICAL METHODS, 2001, 6 (04) :317-329