Algorithmic trading using combinational rule vector and deep reinforcement learning

被引:6
作者
Huang, Zhen [1 ]
Li, Ning [1 ]
Mei, Wenliang [2 ]
Gong, Wenyong [1 ]
机构
[1] Jinan Univ, Dept Math, Guangzhou, Peoples R China
[2] CHN Energy Investment Grp Co LTD, Beijing, Peoples R China
关键词
Algorithmic trading; Combinational rule vectors; Deep reinforcement learning;
D O I
10.1016/j.asoc.2023.110802
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Algorithmic trading rules are widely used in financial markets as technical analysis tools for security trading. However, traditional trading rules are not sufficient to make a trading decision. In this paper, we propose a new algorithmic trading method called CR-DQN, which incorporates deep Q-learning with two popular trading rules: moving average (MA) and trading range break-out (TRB). The input of deep Q-learning is combinational rule vectors, whose component is a linear combination of 140 rules produced by MA and TRB with different parameters. Due to non-stationary characteristics, we devise a reward driven combination weight updating scheme to generate combinational rule vectors, which can capture intrinsic features of financial data. Since the sparse reward exists in CR-DQN, we design a piecewise reward function which shows great potential in the experiments. Taking combinational rule vectors as input, the LSTM based Deep Q-learning network is used to learn an optimal algorithmic trading strategy. We apply our model to both Chinese and non-Chinese stock markets, and CR-DQN exhibits the best performance on a variety of evaluation criteria compared to many other approaches, demonstrating the effectiveness of the proposed method.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 42 条
[1]   What is the value of the cross-sectional approach to deep reinforcement learning? [J].
Aboussalah, Amine Mohamed ;
Xu, Ziyun ;
Lee, Chi-Guhn .
QUANTITATIVE FINANCE, 2022, 22 (06) :1091-1111
[2]   Using genetic algorithms to find technical trading rules [J].
Allen, F ;
Karjalainen, R .
JOURNAL OF FINANCIAL ECONOMICS, 1999, 51 (02) :245-271
[3]  
[Anonymous], 1962, "Applied Dynamic Programming
[4]   High-Frequency Trading Strategy Based on Deep Neural Networks [J].
Arevalo, Andres ;
Nino, Jaime ;
Hernandez, German ;
Sandoval, Javier .
INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 :424-436
[5]   A deep learning framework for financial time series using stacked autoencoders and long-short term memory [J].
Bao, Wei ;
Yue, Jun ;
Rao, Yulei .
PLOS ONE, 2017, 12 (07)
[6]   GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY [J].
BOLLERSLEV, T .
JOURNAL OF ECONOMETRICS, 1986, 31 (03) :307-327
[7]  
Box GEP., 1970, Time Series Analysis: Forecasting and Control, V65, P1509
[8]   Stock trading system based on the multi-objective particle swarm optimization of technical indicators on end-of-day market data [J].
Briza, Antonio C. ;
Naval, Prospero C., Jr. .
APPLIED SOFT COMPUTING, 2011, 11 (01) :1191-1201
[9]   SIMPLE TECHNICAL TRADING RULES AND THE STOCHASTIC PROPERTIES OF STOCK RETURNS [J].
BROCK, W ;
LAKONISHOK, J ;
LEBARON, B .
JOURNAL OF FINANCE, 1992, 47 (05) :1731-1764
[10]  
Chan E., 2013, Algorithmic trading: winning strategies and their rationale, V625