MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading

被引：0

作者：

Cheng, Xi ^{[1
]}

Zhang, Jinghao ^{[1
]}

Zeng, Yunan ^{[1
]}

Xue, Wenfang ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT IV, PAKDD 2024 | 2024年 / 14648卷

基金：

中国国家自然科学基金;

关键词：

Algorithmic trading; Reinforcement learning; Optimal transport;

D O I：

10.1007/978-981-97-2238-9_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Algorithmic trading refers to executing buy and sell orders for specific assets based on automatically identified trading opportunities. Strategies based on reinforcement learning (RL) have demonstrated remarkable capabilities in addressing algorithmic trading problems. However, the trading patterns differ among market conditions due to shifted distribution data. Ignoring multiple patterns in the data will undermine the performance of RL. In this paper, we propose MOT, which designs multiple actors with disentangled representation learning to model the different patterns of the market. Furthermore, we incorporate the Optimal Transport (OT) algorithm to allocate samples to the appropriate actor by introducing a regularization loss term. Additionally, we propose Pretrain Module to facilitate imitation learning by aligning the outputs of actors with expert strategy and better balance the exploration and exploitation of RL. Experimental results on real futures market data demonstrate that MOT exhibits excellent profit capabilities while balancing risks. Ablation studies validate the effectiveness of the components of MOT.

引用

页码：30 / 42

页数：13

共 50 条

[31] Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
Zhou, Dongruo
Gu, Quanquan
Szepesvari, Csaba
CONFERENCE ON LEARNING THEORY, VOL 134, 2021, 134
[32] Pairs trading strategy optimization using the reinforcement learning method: a cointegration approach
Saeid Fallahpour
Hasan Hakimian
Khalil Taheri
Ehsan Ramezanifar
Soft Computing, 2016, 20 : 5051 - 5066
[33] Pairs trading strategy optimization using the reinforcement learning method: a cointegration approach
Fallahpour, Saeid
Hakimian, Hasan
Taheri, Khalil
Ramezanifar, Ehsan
SOFT COMPUTING, 2016, 20 (12) : 5051 - 5066
[34] Unleashing the Power of Multi-Agent Reinforcement Learning for Algorithmic Trading in the Digital Financial Frontier and Enterprise Information Systems
Sarin, Saket
Singh, Sunil K.
Kumar, Sudhakar
Goyal, Shivam
Gupta, Brij Bhooshan
Alhalabi, Wadee
Arya, Varsha
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 3123 - 3138
[35] Time-driven feature-aware jointly deep reinforcement learning for financial signal representation and algorithmic trading
Lei, Kai
Zhang, Bing
Li, Yu
Yang, Min
Shen, Ying
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
[36] Learning Multiple Stock Trading Patterns with Temporal Routing Adaptor and Optimal Transport
Lin, Hengxu
Zhou, Dong
Liu, Weiqing
Bian, Jiang
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1017 - 1026
[37] Adaptive Algorithm for Selecting the Optimal Trading Strategy Based on Reinforcement Learning for Managing a Hedge Fund
Belyakov, B.
Sizykh, D.
IEEE ACCESS, 2024, 12 : 189047 - 189063
[38] Offense and defence against adversarial sample: A reinforcement learning method in energy trading market
Li, Donghe
Yang, Qingyu
Ma, Linyue
Peng, Zhenhua
Liao, Xiao
FRONTIERS IN ENERGY RESEARCH, 2023, 10
[39] Offline Reinforcement Learning Via Optimal Transport And Improved Performance Difference Theorem
Wang, Boyi
Lin, Kai
Sun, Guohan
2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 619 - 626
[40] DAC: Quantized Optimal Transport Reward-based Reinforcement Learning Approach to Detoxify Query Auto-Completion
Maheswaran, Aishwarya
Maurya, Kaushal Kumar
Gupta, Manish
Desarkar, Maunendra Sankar
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 608 - 618

← 1 2 3 4 5 →