Dynamic stock-decision ensemble strategy based on deep reinforcement learning

被引:11
作者
Yu, Xiaoming [1 ]
Wu, Wenjun [1 ]
Liao, Xingchuang [1 ]
Han, Yong [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
基金
国家重点研发计划;
关键词
Investment market; Stock trading; Deep reinforcement learning; Real-time decision-making; PREDICTION;
D O I
10.1007/s10489-022-03606-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a complex and changeable stock market, it is very important to design a trading agent that can benefit investors. In this paper, we propose two stock trading decision-making methods. First, we propose a nested reinforcement learning (Nested RL) method based on three deep reinforcement learning models (the Advantage Actor Critic, Deep Deterministic Policy Gradient, and Soft Actor Critic models) that adopts an integration strategy by nesting reinforcement learning on the basic decision-maker. Thus, this strategy can dynamically select agents according to the current situation to generate trading decisions made under different market environments. Second, to inherit the advantages of three basic decision-makers, we consider confidence and propose a weight random selection with confidence (WRSC) strategy. In this way, investors can gain more profits by integrating the advantages of all agents. All the algorithms are validated for the U.S., Japanese and British stocks and evaluated by different performance indicators. The experimental results show that the annualized return, cumulative return, and Sharpe ratio values of our ensemble strategy are higher than those of the baselines, which indicates that our nested RL and WRSC methods can assist investors in their portfolio management with more profits under the same level of investment risk.
引用
收藏
页码:2452 / 2470
页数:19
相关论文
共 45 条
  • [1] [Anonymous], IEEE Transactions on Intelligent Transportation Systems
  • [2] [Anonymous], OpenAI Baselines: ACKTR A2C
  • [3] Multiobjective Evolution of Fuzzy Rough Neural Network via Distributed Parallelism for Stock Prediction
    Cao, Bin
    Zhao, Jianwei
    Lv, Zhihan
    Gu, Yu
    Yang, Peng
    Halgamuge, Saman K.
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 939 - 952
  • [4] Multi-DQN: An ensemble of Deep Q-learning agents for stock market forecasting
    Carta, Salvatore
    Ferreira, Anselmo
    Podda, Alessandro Sebastian
    Recupero, Diego Reforgiato
    Sanna, Antonio
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
  • [5] A multi-layer and multi-ensemble stock trader using deep learning and deep reinforcement learning
    Carta, Salvatore
    Corriga, Andrea
    Ferreira, Anselmo
    Podda, Alessandro Sebastian
    Recupero, Diego Reforgiato
    [J]. APPLIED INTELLIGENCE, 2021, 51 (02) : 889 - 905
  • [6] Forward Forecast of Stock Price Using Sliding-Window Metaheuristic-Optimized Machine-Learning Regression
    Chou, Jui-Sheng
    Thi-Kha Nguyen
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3132 - 3142
  • [7] Coleman B, 2021, HUMAN VERSUS MACHINE
  • [8] Daldaban, 2020, ARTIFICIALLY INTELLI
  • [9] Deng Y, 2021, OPTIMIZATION BLOCKCH, V385
  • [10] Deep Direct Reinforcement Learning for Financial Signal Representation and Trading
    Deng, Yue
    Bao, Feng
    Kong, Youyong
    Ren, Zhiquan
    Dai, Qionghai
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 653 - 664