MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading

被引:0
作者
Cheng, Xi [1 ]
Zhang, Jinghao [1 ]
Zeng, Yunan [1 ]
Xue, Wenfang [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT IV, PAKDD 2024 | 2024年 / 14648卷
基金
中国国家自然科学基金;
关键词
Algorithmic trading; Reinforcement learning; Optimal transport;
D O I
10.1007/978-981-97-2238-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Algorithmic trading refers to executing buy and sell orders for specific assets based on automatically identified trading opportunities. Strategies based on reinforcement learning (RL) have demonstrated remarkable capabilities in addressing algorithmic trading problems. However, the trading patterns differ among market conditions due to shifted distribution data. Ignoring multiple patterns in the data will undermine the performance of RL. In this paper, we propose MOT, which designs multiple actors with disentangled representation learning to model the different patterns of the market. Furthermore, we incorporate the Optimal Transport (OT) algorithm to allocate samples to the appropriate actor by introducing a regularization loss term. Additionally, we propose Pretrain Module to facilitate imitation learning by aligning the outputs of actors with expert strategy and better balance the exploration and exploitation of RL. Experimental results on real futures market data demonstrate that MOT exhibits excellent profit capabilities while balancing risks. Ablation studies validate the effectiveness of the components of MOT.
引用
收藏
页码:30 / 42
页数:13
相关论文
共 50 条
  • [31] Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes
    Zhou, Dongruo
    Gu, Quanquan
    Szepesvari, Csaba
    CONFERENCE ON LEARNING THEORY, VOL 134, 2021, 134
  • [32] Pairs trading strategy optimization using the reinforcement learning method: a cointegration approach
    Saeid Fallahpour
    Hasan Hakimian
    Khalil Taheri
    Ehsan Ramezanifar
    Soft Computing, 2016, 20 : 5051 - 5066
  • [33] Pairs trading strategy optimization using the reinforcement learning method: a cointegration approach
    Fallahpour, Saeid
    Hakimian, Hasan
    Taheri, Khalil
    Ramezanifar, Ehsan
    SOFT COMPUTING, 2016, 20 (12) : 5051 - 5066
  • [34] Unleashing the Power of Multi-Agent Reinforcement Learning for Algorithmic Trading in the Digital Financial Frontier and Enterprise Information Systems
    Sarin, Saket
    Singh, Sunil K.
    Kumar, Sudhakar
    Goyal, Shivam
    Gupta, Brij Bhooshan
    Alhalabi, Wadee
    Arya, Varsha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 3123 - 3138
  • [35] Time-driven feature-aware jointly deep reinforcement learning for financial signal representation and algorithmic trading
    Lei, Kai
    Zhang, Bing
    Li, Yu
    Yang, Min
    Shen, Ying
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
  • [36] Learning Multiple Stock Trading Patterns with Temporal Routing Adaptor and Optimal Transport
    Lin, Hengxu
    Zhou, Dong
    Liu, Weiqing
    Bian, Jiang
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1017 - 1026
  • [37] Adaptive Algorithm for Selecting the Optimal Trading Strategy Based on Reinforcement Learning for Managing a Hedge Fund
    Belyakov, B.
    Sizykh, D.
    IEEE ACCESS, 2024, 12 : 189047 - 189063
  • [38] Offense and defence against adversarial sample: A reinforcement learning method in energy trading market
    Li, Donghe
    Yang, Qingyu
    Ma, Linyue
    Peng, Zhenhua
    Liao, Xiao
    FRONTIERS IN ENERGY RESEARCH, 2023, 10
  • [39] Offline Reinforcement Learning Via Optimal Transport And Improved Performance Difference Theorem
    Wang, Boyi
    Lin, Kai
    Sun, Guohan
    2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2023, : 619 - 626
  • [40] DAC: Quantized Optimal Transport Reward-based Reinforcement Learning Approach to Detoxify Query Auto-Completion
    Maheswaran, Aishwarya
    Maurya, Kaushal Kumar
    Gupta, Manish
    Desarkar, Maunendra Sankar
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 608 - 618