Supervised actor-critic reinforcement learning with action feedback for algorithmic trading

被引：5

作者：

Sun, Qizhou ^{[1
]}

Si, Yain-Whar ^{[1
]}

机构：

[1] Univ Macau, Dept Comp & Informat Sci, Ave da Univ, Taipa, Macau, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 13期

关键词：

Finance; Reinforcement learning; Supervised learning; Algorithmic trading; ENERGY;

D O I：

10.1007/s10489-022-04322-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning is one of the promising approaches for algorithmic trading in financial markets. However, in certain situations, buy or sell orders issued by an algorithmic trading program may not be fulfilled entirely. By considering the actual scenarios from the financial markets, in this paper, we propose a novel framework named Supervised Actor-Critic Reinforcement Learning with Action Feedback (SACRL-AF) for solving this problem. The action feedback mechanism of SACRL-AF notifies the actor about the dealt positions and corrects the transitions of the replay buffer. Meanwhile, the dealt positions are used as the labels for the supervised learning. Recent studies have shown that Deep Deterministic Policy Gradient (DDPG) and Twin Delayed Deep Deterministic Policy Gradient (TD3) are more stable and superior to other actor-critic algorithms. Against this background, based on the proposed SACRL-AF framework, two reinforcement learning algorithms henceforth referred to as Supervised Deep Deterministic Policy Gradient with Action Feedback (SDDPG-AF) and Supervised Twin Delayed Deep Deterministic Policy Gradient with Action Feedback (STD3-AF) are proposed in this paper. Experimental results show that SDDPG-AF and STD3-AF achieve the state-of-art performance in profitability.

引用

页码：16875 / 16892

页数：18

共 50 条

[21] Actor-Critic Reinforcement Learning for Automatic Left Atrial Appendage Segmentation
Abdullah, Al Walid
Yun, Il Dong
PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 609 - 612
[22] On the sample complexity of actor-critic method for reinforcement learning with function approximation
Kumar, Harshat
Koppel, Alec
Ribeiro, Alejandro
MACHINE LEARNING, 2023, 112 (07) : 2433 - 2467
[23] An Actor-Critic Reinforcement Learning Approach for Energy Harvesting Communications Systems
Masadeh, Ala'eddin
Wang, Zhengdao
Kamal, Ahmed E.
2019 28TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN), 2019,
[24] A Novel Actor-Critic Motor Reinforcement Learning for Continuum Soft Robots
Pantoja-Garcia, Luis
Parra-Vega, Vicente
Garcia-Rodriguez, Rodolfo
Vazquez-Garcia, Carlos Ernesto
ROBOTICS, 2023, 12 (05)
[25] A multi-agent reinforcement learning using Actor-Critic methods
Li, Chun-Gui
Wang, Meng
Yuan, Qing-Neng
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 878 - 882
[26] On the sample complexity of actor-critic method for reinforcement learning with function approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
Machine Learning, 2023, 112 : 2433 - 2467
[27] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
Torres, Edgar
Xu, Lei
Sardarmehni, Tohid
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
[28] Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning
Shi, Daming
Guo, Xudong
Liu, Yi
Fan, Wenhui
ENTROPY, 2022, 24 (06)
[29] A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces
Prins, Noeline W.
Sanchez, Justin C.
Prasad, Abhishek
FRONTIERS IN NEUROSCIENCE, 2014, 8
[30] Adaptive Assist-as-needed Control Based on Actor-Critic Reinforcement Learning
Zhang, Yufeng
Li, Shuai
Nolan, Karen J.
Zanotto, Damiano
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4066 - 4071

← 1 2 3 4 5 →