Algorithms may not learn to play a unique Nash equilibrium

被引：0

作者：

Fujiwara-Greve, Takako ^{[1
]}

Nielsen, Carsten Krabbe ^{[2
]}

机构：

[1] Keio Univ, Dept Econ, Minato Ku, 2-15-45 Mita, Tokyo 1088345, Japan

[2] Catholic Univ Milan, Dipartimento Econ & Finanza, Via Necchi 5, I-20123 Milan, Italy

来源：

JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE | 2021年 / 4卷 / 02期

关键词：

Algorithm; Learning; Nash equilibrium; Impossibility; EVOLUTION; GAMES;

D O I：

10.1007/s42001-021-00109-9

中图分类号：

O1 [数学]; C [社会科学总论];

学科分类号：

03 ; 0303 ; 0701 ; 070101 ;

摘要：

There is a widespread hope that, in the near future, algorithms become so sophisticated that "solutions" to most problems are found by machines. In this note, we throw some doubts on this expectation by showing the following impossibility result: given a set of finite-memory, finite-iteration algorithms, a continuum of games exist, whose unique and strict Nash equilibrium cannot be reached from a large set of initial states. A Nash equilibrium is a social solution to conflicts of interest, and hence finite algorithms should not be always relied upon for social problems. Our result also shows how to construct games to deceive a given set of algorithms to be trapped in a cycle without a Nash equilibrium.

引用

页码：839 / 850

页数：12

共 20 条

[1] STRATEGY SUBSETS CLOSED UNDER RATIONAL BEHAVIOR [J].

BASU, K ;

WEIBULL, JW .

ECONOMICS LETTERS, 1991, 36 (02) :141-146

[2]

Brown George W., 1951, ACTIVITY ANAL PRODUC, P374

[3]

Foster DP, 2003, GAME ECON BEHAV, V45, P73, DOI 10.1016/S0899-8256(02)00025-3

[4] LEARNING MIXED EQUILIBRIA [J].

FUDENBERG, D ;

KREPS, DM .

GAMES AND ECONOMIC BEHAVIOR, 1993, 5 (03) :320-367

[5] Heterogeneous beliefs and local information in stochastic fictitious play [J].

Fudenberg, Drew ;

Takahashi, Satoru .

GAMES AND ECONOMIC BEHAVIOR, 2011, 71 (01) :100-120

[6]

Gilli M., 2001, B ECON RES, V53, P275, DOI [10.1111/1467-8586.00135, DOI 10.1111/1467-8586.00135]

[7] LEARNING BY FORGETFUL PLAYERS [J].

HURKENS, S .

GAMES AND ECONOMIC BEHAVIOR, 1995, 11 (02) :304-329

[8] LEARNING, MUTATION, AND LONG-RUN EQUILIBRIA IN GAMES [J].

KANDORI, M ;

MAILATH, GJ ;

ROB, R .

ECONOMETRICA, 1993, 61 (01) :29-56

[9] Identifying Higher-Order Rationality [J].

Kneeland, Terri .

ECONOMETRICA, 2015, 83 (05) :2065-2079

[10] Real-time deep reinforcement learning based vehicle navigation [J].

Koh, Songsang ;

Zhou, Bo ;

Fang, Hui ;

Yang, Po ;

Yang, Zaili ;

Yang, Qiang ;

Guan, Lin ;

Ji, Zhigang .

APPLIED SOFT COMPUTING, 2020, 96

← 1 2 →