Dynamic stock-decision ensemble strategy based on deep reinforcement learning

被引：11

作者：

Yu, Xiaoming ^{[1
]}

Wu, Wenjun ^{[1
]}

Liao, Xingchuang ^{[1
]}

Han, Yong ^{[1
]}

机构：

[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 02期

基金：

国家重点研发计划;

关键词：

Investment market; Stock trading; Deep reinforcement learning; Real-time decision-making; PREDICTION;

D O I：

10.1007/s10489-022-03606-0

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In a complex and changeable stock market, it is very important to design a trading agent that can benefit investors. In this paper, we propose two stock trading decision-making methods. First, we propose a nested reinforcement learning (Nested RL) method based on three deep reinforcement learning models (the Advantage Actor Critic, Deep Deterministic Policy Gradient, and Soft Actor Critic models) that adopts an integration strategy by nesting reinforcement learning on the basic decision-maker. Thus, this strategy can dynamically select agents according to the current situation to generate trading decisions made under different market environments. Second, to inherit the advantages of three basic decision-makers, we consider confidence and propose a weight random selection with confidence (WRSC) strategy. In this way, investors can gain more profits by integrating the advantages of all agents. All the algorithms are validated for the U.S., Japanese and British stocks and evaluated by different performance indicators. The experimental results show that the annualized return, cumulative return, and Sharpe ratio values of our ensemble strategy are higher than those of the baselines, which indicates that our nested RL and WRSC methods can assist investors in their portfolio management with more profits under the same level of investment risk.

引用

页码：2452 / 2470

页数：19

共 45 条

[1] [Anonymous], IEEE Transactions on Intelligent Transportation Systems
[2] [Anonymous], OpenAI Baselines: ACKTR A2C
[3] Multiobjective Evolution of Fuzzy Rough Neural Network via Distributed Parallelism for Stock Prediction
Cao, Bin
Zhao, Jianwei
Lv, Zhihan
Gu, Yu
Yang, Peng
Halgamuge, Saman K.
[J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 939 - 952
[4] Multi-DQN: An ensemble of Deep Q-learning agents for stock market forecasting
Carta, Salvatore
Ferreira, Anselmo
Podda, Alessandro Sebastian
Recupero, Diego Reforgiato
Sanna, Antonio
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
[5] A multi-layer and multi-ensemble stock trader using deep learning and deep reinforcement learning
Carta, Salvatore
Corriga, Andrea
Ferreira, Anselmo
Podda, Alessandro Sebastian
Recupero, Diego Reforgiato
[J]. APPLIED INTELLIGENCE, 2021, 51 (02) : 889 - 905
[6] Forward Forecast of Stock Price Using Sliding-Window Metaheuristic-Optimized Machine-Learning Regression
Chou, Jui-Sheng
Thi-Kha Nguyen
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3132 - 3142
[7] Coleman B, 2021, HUMAN VERSUS MACHINE
[8] Daldaban, 2020, ARTIFICIALLY INTELLI
[9] Deng Y, 2021, OPTIMIZATION BLOCKCH, V385
[10] Deep Direct Reinforcement Learning for Financial Signal Representation and Trading
Deng, Yue
Bao, Feng
Kong, Youyong
Ren, Zhiquan
Dai, Qionghai
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 653 - 664

← 1 2 3 4 5 →