Hedging of financial derivative contracts via Monte Carlo tree search

被引：0

作者：

Szehr, Oleg ^{[1
]}

机构：

[1] SUPSI USI, Dalle Molle Inst Artificial Intelligence IDSIA, Via La Santa 1, CH-6962 Lugano, Switzerland

来源：

JOURNAL OF COMPUTATIONAL FINANCE | 2023年 / 27卷 / 02期

关键词：

reinforcement learning; Monte Carlo tree search (MCTS); pricing and hedging of derivative contracts; AlphaZero; utility optimization; CONTINGENT CLAIMS; ALGORITHM; OPTIONS; GAME; GO;

D O I：

10.21314/JCF.2023.009

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

The construction of replication strategies for the pricing and hedging of derivative contracts in incomplete markets is a key problem in financial engineering. We interpret this problem as a "game with the world", where one player (the investor) bets on what will happen and the other player (the market) decides what will happen. Inspired by the success of the Monte Carlo tree search (MCTS) in a variety of games and stochastic multiperiod planning problems, we introduce this algorithm as a method for replication in the presence of risk and market friction. Unlike modelfree reinforcement learning methods (such as Q-learning), MCTS makes explicit use of an environment model. The role of this model is taken by a market simulator, which is frequently adopted even in the training of model-free methods, but its use allows MCTS to plan for the consequences of decisions prior to the execution of actions. We conduct experiments with the AlphaZero variant of MCTS on toy examples of simple market models and derivatives with simple payoff structures. We show that MCTS is capable of maximizing the utility of the investor's terminal wealth in a setting where no external pricing information is available and rewards are granted only as a result of contractual cashflows. In this setting, we observe that MCTS hassuperior performance compared with the deep Q-network algorithm and comparable performance to "deep-hedging" methods.

引用

页码：47 / 80

页数：34

共 60 条

[1] CONTINGENT CLAIMS VALUATION WHEN THE SECURITY PRICE IS A COMBINATION OF AN ITO PROCESS AND A RANDOM POINT PROCESS [J].