DEEP FICTITIOUS PLAY FOR STOCHASTIC DIFFERENTIAL GAMES

被引：0

作者：

Hu, Ruimeng ^{[1
,2
,3
]}

机构：

[1] Columbia Univ, Dept Stat, New York, NY 10027 USA

[2] Univ Calif Santa Barbara, Dept Math, Santa Barbara, CA 93106 USA

[3] Univ Calif Santa Barbara, Dept Stat & Appl Probabil, Santa Barbara, CA 93106 USA

来源：

COMMUNICATIONS IN MATHEMATICAL SCIENCES | 2021年 / 19卷 / 02期

关键词：

Stochastic differential game; fictitious play; deep learning; Nash equilibrium; MEAN-FIELD GAMES; CONVERGENCE; EQUATIONS;

D O I：

暂无

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

In this paper, we apply the idea of fictitious play to design deep neural networks (DNNs), and develop deep learning theory and algorithms for computing the Nash equilibrium of asymmetric N-player non-zero-sum stochastic differential games, for which we refer as deep fictitious play, a multi-stage learning process. Specifically at each stage, we propose the strategy of letting individual player optimize her own payoff subject to the other players' previous actions, equivalent to solving N decoupled stochastic control optimization problems, which are approximated by DNNs. Therefore, the fictitious play strategy leads to a structure consisting of N DNNs, which only communicate at the end of each stage. The resulting deep learning algorithm based on fictitious play is scalable, parallel and model-free, i.e., using GPU parallelization, it can be applied to any N-player stochastic differential game with different symmetries and heterogeneities (e.g., existence of major players). We illustrate the performance of the deep learning algorithm by comparing to the closed-form solution of the linear quadratic game. Moreover, we prove the convergence of fictitious play under appropriate assumptions, and verify that the convergent limit forms an open-loop Nash equilibrium. We also discuss the extensions to other strategies designed upon fictitious play and closed-loop Nash equilibrium in the end.

引用

页码：325 / 353

页数：29

共 64 条

[1]

[Anonymous], 2018, ARXIV181204300

[2]

[Anonymous], 2016, DEEP REINF LEARN WOR

[3] Deep Neural Networks Algorithms for Stochastic Control Problems on Finite Horizon: Numerical Applications [J].

Bachouch, Achref ;

Hure, Come ;

Langrene, Nicolas ;

Huyen Pham .

METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2022, 24 (01) :143-178

[4] Learning Deep Architectures for AI [J].

Bengio, Yoshua .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127

[5] Fictitious play in 2 X n games [J].

Berger, U .

JOURNAL OF ECONOMIC THEORY, 2005, 120 (02) :139-154

[6] Brown's original fictitious play [J].

Berger, Ulrich .

JOURNAL OF ECONOMIC THEORY, 2007, 135 (01) :572-578

[7] Stable solutions in potential mean field game systems [J].

Briani, Ariela ;

Cardaliaguet, Pierre .

NODEA-NONLINEAR DIFFERENTIAL EQUATIONS AND APPLICATIONS, 2018, 25 (01)

[8]

Brown G., 1951, Act. Anal. Prod Allocation, V13, P374

[9]

Brown George W, 1949, Technical report

[10] LEARNING IN MEAN FIELD GAMES: THE FICTITIOUS PLAY [J].

Cardaliaguet, Pierre ;

Hadikhanloo, Saeed .

ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2017, 23 (02) :569-591

← 1 2 3 4 5 6 7 →