A deep Q-learning based algorithmic trading system for commodity futures markets

被引:7
作者
Massahi, Mahdi [1 ]
Mahootchi, Masoud [1 ]
机构
[1] Amirkabir Univ Technol, Dept Ind Engn & Management Syst, 424 Hafez Ave, Tehran, Iran
关键词
Algorithmic trading; Commodity futures market; Deep Q-learning; Double Deep Q-learning; Market simulator; HIGH-FREQUENCY; STRATEGIES; GP;
D O I
10.1016/j.eswa.2023.121711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, investors seek more sophisticated decision-making tools that maximize their profit from investing in the financial markets by suitably determining the optimal position, trading time, price, and volume. This paper proposes a novel intraday algorithmic trading system for volatile commodity futures markets based on a Deep Q-network (DQN) algorithm and its robust double-version (DDQN). The higher volatility, leverage property, and more liquidity in futures contracts give investors more opportunity to take advantage of speculative behaviors with a relatively small amount of capital; however, the volatility brings more difficulties in the learning phase. As an essential prerequisite to training and evaluating any trading algorithm in the futures market, we develop a simulator to replicate a real futures exchange market environment that executes recommended trading signals by handling the clearing and margin management and the pre-order checking mechanisms. Moreover, this study provides a new definition of the continuous state and action spaces that match the futures market's characteristics. To address the curse of dimensionality, we utilize a multi-agent architecture equipped with the Gated Recurrent Unit (GRU) networks to approximate the Q-values functions. The experimental results demonstrate that implementing the proposed trading algorithms (especially the DDQN) into the actual intraday data of gold coin futures contracts significantly outperforms the benchmarks in terms of return, risk, and risk-adjusted return.
引用
收藏
页数:15
相关论文
共 57 条
  • [1] Using genetic algorithms to find technical trading rules
    Allen, F
    Karjalainen, R
    [J]. JOURNAL OF FINANCIAL ECONOMICS, 1999, 51 (02) : 245 - 271
  • [2] Almgren Robert, 2001, J. Risk, V3, P5, DOI DOI 10.21314/JOR.2001.041
  • [3] [Anonymous], 2010, P ADV NEUR INF PROC
  • [4] [Anonymous], 2003, International Review of Economics and Finance, DOI DOI 10.1016/S1059-0560(02)00129-6
  • [5] Becker LA, 2003, PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, P1136
  • [6] Bellman R., 1961, Adaptive Control Processes, DOI DOI 10.1515/9781400874668
  • [7] Robust technical trading strategies using GP for algorithmic portfolio selection
    Berutich, Jose Manuel
    Lopez, Francisco
    Luna, Francisco
    Quintana, David
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 307 - 315
  • [8] Financial econometric analysis at ultra-high frequency: Data handling concerns
    Brownlees, C. T.
    Gallo, G. M.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 51 (04) : 2232 - 2245
  • [9] Reinforcement learning applied to Forex trading
    Carapuco, Joao
    Neves, Rui
    Horta, Nuno
    [J]. APPLIED SOFT COMPUTING, 2018, 73 : 783 - 794
  • [10] Multi-DQN: An ensemble of Deep Q-learning agents for stock market forecasting
    Carta, Salvatore
    Ferreira, Anselmo
    Podda, Alessandro Sebastian
    Recupero, Diego Reforgiato
    Sanna, Antonio
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164