A simple learning agent interacting with an agent-based market model

被引：6

作者：

Dicks, Matthew ^{[1
]}

Paskaramoorthy, Andrew ^{[1
]}

Gebbie, Tim ^{[1
]}

机构：

[1] Univ Cape Town, Dept Stat Sci, ZA-7700 Cape Town, South Africa

来源：

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS | 2024年 / 633卷

关键词：

Strategic order-splitting; Reinforcement learning; Market simulation; Agent-based model; PRICE-IMPACT;

D O I：

10.1016/j.physa.2023.129363

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

We consider the learning dynamics of a single reinforcement learning optimal execution trading agent when it interacts with an event-driven agent-based financial market model. Trading takes place asynchronously through a matching engine in event time. The optimal execution agent is considered at different levels of initial order sizes and differently sized state spaces. The resulting impact on the agent-based model and market is considered using a calibration approach that explores changes in the empirical stylised facts and price impact curves. Convergence, volume trajectory and action trace plots are used to visualise the learning dynamics. The smaller state space agents had the number of states they visited converge much faster than the larger state space agents, and they were able to start learning to trade intuitively using the spread and volume states. We find that the moments of the model are robust to the impact of the learning agents, except for the Hurst exponent, which was lowered by the introduction of strategic order-splitting. The introduction of the learning agent preserves the shape of the price impact curves but can reduce the trade-sign auto-correlations and increase the micro-price volatility when the trading volumes increase.

引用

页数：18

共 57 条

[11]

Cartea A., 2015, Algorithmic and High-Frequency Trading

[12]

Cont R, 2001, QUANT FINANC, V1, P223, DOI [10.1088/1469-7688/1/2/304, 10.1080/713665670]

[13]

Cornish Hellaby Watkins ChristopherJohn., 1989, LEARNING DELAYED REW

[14] From agent-based modeling to actor-based reactive systems in the analysis of financial networks [J].

Crafa, Silvia .

JOURNAL OF ECONOMIC INTERACTION AND COORDINATION, 2021, 16 (03) :649-673

[15] DISTRIBUTION OF THE ESTIMATORS FOR AUTOREGRESSIVE TIME-SERIES WITH A UNIT ROOT [J].

DICKEY, DA ;

FULLER, WA .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (366) :427-431

[16]

Dicks M, 2022, A simple learning agent interacting with an agent -based market model: Julia code, DOI [10.25375/uct.21163723.v1,figshare, DOI 10.25375/UCT.21163723.V1,FIGSHARE]

[17]

Dicks M, 2023, Arxiv, DOI arXiv:2303.07393

[18]

Dieci R, 2018, HANDB COMPUT ECON, V4, P257, DOI 10.1016/bs.hescom.2018.03.002

[19] On the problem of calibrating an agent based model for financial markets [J].

Fabretti, Annalisa .

JOURNAL OF ECONOMIC INTERACTION AND COORDINATION, 2013, 8 (02) :277-293

[20]

Farmer JD, 2005, P NATL ACAD SCI USA, V102, P2254, DOI 10.1073/pnas.0409157102

← 1 2 3 4 5 6 →