Algorithms may not learn to play a unique Nash equilibrium

被引:0
作者
Fujiwara-Greve, Takako [1 ]
Nielsen, Carsten Krabbe [2 ]
机构
[1] Keio Univ, Dept Econ, Minato Ku, 2-15-45 Mita, Tokyo 1088345, Japan
[2] Catholic Univ Milan, Dipartimento Econ & Finanza, Via Necchi 5, I-20123 Milan, Italy
来源
JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE | 2021年 / 4卷 / 02期
关键词
Algorithm; Learning; Nash equilibrium; Impossibility; EVOLUTION; GAMES;
D O I
10.1007/s42001-021-00109-9
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
There is a widespread hope that, in the near future, algorithms become so sophisticated that "solutions" to most problems are found by machines. In this note, we throw some doubts on this expectation by showing the following impossibility result: given a set of finite-memory, finite-iteration algorithms, a continuum of games exist, whose unique and strict Nash equilibrium cannot be reached from a large set of initial states. A Nash equilibrium is a social solution to conflicts of interest, and hence finite algorithms should not be always relied upon for social problems. Our result also shows how to construct games to deceive a given set of algorithms to be trapped in a cycle without a Nash equilibrium.
引用
收藏
页码:839 / 850
页数:12
相关论文
共 20 条
  • [1] STRATEGY SUBSETS CLOSED UNDER RATIONAL BEHAVIOR
    BASU, K
    WEIBULL, JW
    [J]. ECONOMICS LETTERS, 1991, 36 (02) : 141 - 146
  • [2] Brown GW, 1951, ACTIVITY ANAL PRODUC, P374
  • [3] Foster DP, 2003, GAME ECON BEHAV, V45, P73, DOI 10.1016/S0899-8256(02)00025-3
  • [4] LEARNING MIXED EQUILIBRIA
    FUDENBERG, D
    KREPS, DM
    [J]. GAMES AND ECONOMIC BEHAVIOR, 1993, 5 (03) : 320 - 367
  • [5] Heterogeneous beliefs and local information in stochastic fictitious play
    Fudenberg, Drew
    Takahashi, Satoru
    [J]. GAMES AND ECONOMIC BEHAVIOR, 2011, 71 (01) : 100 - 120
  • [6] Gilli M., 2001, B ECON RES, V53, P275, DOI [10.1111/1467-8586.00135, DOI 10.1111/1467-8586.00135]
  • [7] LEARNING BY FORGETFUL PLAYERS
    HURKENS, S
    [J]. GAMES AND ECONOMIC BEHAVIOR, 1995, 11 (02) : 304 - 329
  • [8] LEARNING, MUTATION, AND LONG-RUN EQUILIBRIA IN GAMES
    KANDORI, M
    MAILATH, GJ
    ROB, R
    [J]. ECONOMETRICA, 1993, 61 (01) : 29 - 56
  • [9] Identifying Higher-Order Rationality
    Kneeland, Terri
    [J]. ECONOMETRICA, 2015, 83 (05) : 2065 - 2079
  • [10] Real-time deep reinforcement learning based vehicle navigation
    Koh, Songsang
    Zhou, Bo
    Fang, Hui
    Yang, Po
    Yang, Zaili
    Yang, Qiang
    Guan, Lin
    Ji, Zhigang
    [J]. APPLIED SOFT COMPUTING, 2020, 96