DEEP FICTITIOUS PLAY FOR STOCHASTIC DIFFERENTIAL GAMES

被引:0
作者
Hu, Ruimeng [1 ,2 ,3 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] Univ Calif Santa Barbara, Dept Math, Santa Barbara, CA 93106 USA
[3] Univ Calif Santa Barbara, Dept Stat & Appl Probabil, Santa Barbara, CA 93106 USA
关键词
Stochastic differential game; fictitious play; deep learning; Nash equilibrium; MEAN-FIELD GAMES; CONVERGENCE; EQUATIONS;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we apply the idea of fictitious play to design deep neural networks (DNNs), and develop deep learning theory and algorithms for computing the Nash equilibrium of asymmetric N-player non-zero-sum stochastic differential games, for which we refer as deep fictitious play, a multi-stage learning process. Specifically at each stage, we propose the strategy of letting individual player optimize her own payoff subject to the other players' previous actions, equivalent to solving N decoupled stochastic control optimization problems, which are approximated by DNNs. Therefore, the fictitious play strategy leads to a structure consisting of N DNNs, which only communicate at the end of each stage. The resulting deep learning algorithm based on fictitious play is scalable, parallel and model-free, i.e., using GPU parallelization, it can be applied to any N-player stochastic differential game with different symmetries and heterogeneities (e.g., existence of major players). We illustrate the performance of the deep learning algorithm by comparing to the closed-form solution of the linear quadratic game. Moreover, we prove the convergence of fictitious play under appropriate assumptions, and verify that the convergent limit forms an open-loop Nash equilibrium. We also discuss the extensions to other strategies designed upon fictitious play and closed-loop Nash equilibrium in the end.
引用
收藏
页码:325 / 353
页数:29
相关论文
共 64 条
[1]  
[Anonymous], 2018, ARXIV181204300
[2]  
[Anonymous], 2016, DEEP REINF LEARN WOR
[3]   Deep Neural Networks Algorithms for Stochastic Control Problems on Finite Horizon: Numerical Applications [J].
Bachouch, Achref ;
Hure, Come ;
Langrene, Nicolas ;
Huyen Pham .
METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2022, 24 (01) :143-178
[4]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[5]   Fictitious play in 2 X n games [J].
Berger, U .
JOURNAL OF ECONOMIC THEORY, 2005, 120 (02) :139-154
[6]   Brown's original fictitious play [J].
Berger, Ulrich .
JOURNAL OF ECONOMIC THEORY, 2007, 135 (01) :572-578
[7]   Stable solutions in potential mean field game systems [J].
Briani, Ariela ;
Cardaliaguet, Pierre .
NODEA-NONLINEAR DIFFERENTIAL EQUATIONS AND APPLICATIONS, 2018, 25 (01)
[8]  
Brown G., 1951, Act. Anal. Prod Allocation, V13, P374
[9]  
Brown George W, 1949, Technical report
[10]   LEARNING IN MEAN FIELD GAMES: THE FICTITIOUS PLAY [J].
Cardaliaguet, Pierre ;
Hadikhanloo, Saeed .
ESAIM-CONTROL OPTIMISATION AND CALCULUS OF VARIATIONS, 2017, 23 (02) :569-591