Supervised actor-critic reinforcement learning with action feedback for algorithmic trading

被引：5

作者：

Sun, Qizhou ^{[1
]}

Si, Yain-Whar ^{[1
]}

机构：

[1] Univ Macau, Dept Comp & Informat Sci, Ave da Univ, Taipa, Macau, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 13期

关键词：

Finance; Reinforcement learning; Supervised learning; Algorithmic trading; ENERGY;

D O I：

10.1007/s10489-022-04322-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning is one of the promising approaches for algorithmic trading in financial markets. However, in certain situations, buy or sell orders issued by an algorithmic trading program may not be fulfilled entirely. By considering the actual scenarios from the financial markets, in this paper, we propose a novel framework named Supervised Actor-Critic Reinforcement Learning with Action Feedback (SACRL-AF) for solving this problem. The action feedback mechanism of SACRL-AF notifies the actor about the dealt positions and corrects the transitions of the replay buffer. Meanwhile, the dealt positions are used as the labels for the supervised learning. Recent studies have shown that Deep Deterministic Policy Gradient (DDPG) and Twin Delayed Deep Deterministic Policy Gradient (TD3) are more stable and superior to other actor-critic algorithms. Against this background, based on the proposed SACRL-AF framework, two reinforcement learning algorithms henceforth referred to as Supervised Deep Deterministic Policy Gradient with Action Feedback (SDDPG-AF) and Supervised Twin Delayed Deep Deterministic Policy Gradient with Action Feedback (STD3-AF) are proposed in this paper. Experimental results show that SDDPG-AF and STD3-AF achieve the state-of-art performance in profitability.

引用

页码：16875 / 16892

页数：18

共 50 条

[1] Supervised actor-critic reinforcement learning with action feedback for algorithmic trading
Qizhou Sun
Yain-Whar Si
Applied Intelligence, 2023, 53 : 16875 - 16892
[2] A World Model for Actor-Critic in Reinforcement Learning
Panov, A. I.
Ugadiarov, L. A.
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
[3] A fuzzy Actor-Critic reinforcement learning network
Wang, Xue-Song
Cheng, Yu-Hu
Yi, Jian-Qiang
INFORMATION SCIENCES, 2007, 177 (18) : 3764 - 3781
[4] Research on actor-critic reinforcement learning in RoboCup
Guo, He
Liu, Tianying
Wang, Yuxin
Chen, Feng
Fan, Jianming
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 205 - 205
[5] Multi-actor mechanism for actor-critic reinforcement learning
Li, Lin
Li, Yuze
Wei, Wei
Zhang, Yujia
Liang, Jiye
INFORMATION SCIENCES, 2023, 647
[6] Heterogeneous trading strategies with adaptive fuzzy Actor-Critic reinforcement learning: A behavioral approach
Bekiros, Stelios D.
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2010, 34 (06): : 1153 - 1170
[7] Actor-Critic Reinforcement Learning for Control With Stability Guarantee
Han, Minghao
Zhang, Lixian
Wang, Jun
Pan, Wei
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6217 - 6224
[8] MARS: Malleable Actor-Critic Reinforcement Learning Scheduler
Baheri, Betis
Tronge, Jacob
Fang, Bo
Li, Ang
Chaudhary, Vipin
Guan, Qiang
2022 IEEE INTERNATIONAL PERFORMANCE, COMPUTING, AND COMMUNICATIONS CONFERENCE, IPCCC, 2022,
[9] Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control With Action Constraints
Kasaura, Kazumi
Miura, Shuwa
Kozuno, Tadashi
Yonetani, Ryo
Hoshino, Kenta
Hosoe, Yohei
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4449 - 4456
[10] Forward Actor-Critic for Nonlinear Function Approximation in Reinforcement Learning
Veeriah, Vivek
van Seijen, Harm
Sutton, Richard S.
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 556 - 564

← 1 2 3 4 5 →