A deep Q-learning based algorithmic trading system for commodity futures markets

被引：7

作者：

Massahi, Mahdi ^{[1
]}

Mahootchi, Masoud ^{[1
]}

机构：

[1] Amirkabir Univ Technol, Dept Ind Engn & Management Syst, 424 Hafez Ave, Tehran, Iran

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 237卷

关键词：

Algorithmic trading; Commodity futures market; Deep Q-learning; Double Deep Q-learning; Market simulator; HIGH-FREQUENCY; STRATEGIES; GP;

D O I：

10.1016/j.eswa.2023.121711

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Nowadays, investors seek more sophisticated decision-making tools that maximize their profit from investing in the financial markets by suitably determining the optimal position, trading time, price, and volume. This paper proposes a novel intraday algorithmic trading system for volatile commodity futures markets based on a Deep Q-network (DQN) algorithm and its robust double-version (DDQN). The higher volatility, leverage property, and more liquidity in futures contracts give investors more opportunity to take advantage of speculative behaviors with a relatively small amount of capital; however, the volatility brings more difficulties in the learning phase. As an essential prerequisite to training and evaluating any trading algorithm in the futures market, we develop a simulator to replicate a real futures exchange market environment that executes recommended trading signals by handling the clearing and margin management and the pre-order checking mechanisms. Moreover, this study provides a new definition of the continuous state and action spaces that match the futures market's characteristics. To address the curse of dimensionality, we utilize a multi-agent architecture equipped with the Gated Recurrent Unit (GRU) networks to approximate the Q-values functions. The experimental results demonstrate that implementing the proposed trading algorithms (especially the DDQN) into the actual intraday data of gold coin futures contracts significantly outperforms the benchmarks in terms of return, risk, and risk-adjusted return.

引用

页数：15

共 57 条

[1] Using genetic algorithms to find technical trading rules
Allen, F
Karjalainen, R
[J]. JOURNAL OF FINANCIAL ECONOMICS, 1999, 51 (02) : 245 - 271
[2] Almgren Robert, 2001, J. Risk, V3, P5, DOI DOI 10.21314/JOR.2001.041
[3] [Anonymous], 2010, P ADV NEUR INF PROC
[4] [Anonymous], 2003, International Review of Economics and Finance, DOI DOI 10.1016/S1059-0560(02)00129-6
[5] Becker LA, 2003, PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, P1136
[6] Bellman R., 1961, Adaptive Control Processes, DOI DOI 10.1515/9781400874668
[7] Robust technical trading strategies using GP for algorithmic portfolio selection
Berutich, Jose Manuel
Lopez, Francisco
Luna, Francisco
Quintana, David
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 307 - 315
[8] Financial econometric analysis at ultra-high frequency: Data handling concerns
Brownlees, C. T.
Gallo, G. M.
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (04) : 2232 - 2245
[9] Reinforcement learning applied to Forex trading
Carapuco, Joao
Neves, Rui
Horta, Nuno
[J]. APPLIED SOFT COMPUTING, 2018, 73 : 783 - 794
[10] Multi-DQN: An ensemble of Deep Q-learning agents for stock market forecasting
Carta, Salvatore
Ferreira, Anselmo
Podda, Alessandro Sebastian
Recupero, Diego Reforgiato
Sanna, Antonio
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164

← 1 2 3 4 5 6 →