The Hanabi challenge: A new frontier for AI research

被引:114
|
作者
Bard, Nolan [1 ]
Foerster, Jakob N. [2 ]
Chandar, Sarath [3 ]
Burch, Neil [1 ]
Lanctot, Marc [1 ]
Song, H. Francis [4 ]
Parisotto, Emilio [5 ]
Dumoulin, Vincent [3 ]
Moitra, Subhodeep [3 ]
Hughes, Edward [4 ]
Dunning, Iain [4 ]
Mourad, Shibl [6 ]
Larochelle, Hugo [3 ]
Bellemare, Marc G. [3 ]
Bowling, Michael [1 ]
机构
[1] DeepMind, Edmonton, AB, Canada
[2] Univ Oxford, Oxford, England
[3] Google Brain, Montreal, PQ, Canada
[4] DeepMind, London, England
[5] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[6] DeepMind, Montreal, PQ, Canada
关键词
Multi-agent learning; Challenge paper; Reinforcement learning; Games; Theory of mind; Communication; Imperfect information; Cooperative; ARCADE LEARNING-ENVIRONMENT; COMPREHENSIVE SURVEY; REINFORCEMENT; GAME; GO; POKER;
D O I
10.1016/j.artint.2019.103216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the early days of computing, games have been important testbeds for studying how well machines can do sophisticated decision making. In recent years, machine learning has made dramatic advances with artificial agents reaching superhuman performance in challenge domains like Go, Atari, and some variants of poker. As with their predecessors of chess, checkers, and backgammon, these game domains have driven research by providing sophisticated yet well-defined challenges for artificial intelligence practitioners. We continue this tradition by proposing the game of Hanabi as a new challenge domain with novel problems that arise from its combination of purely cooperative gameplay with two to five players and imperfect information. In particular, we argue that Hanabi elevates reasoning about the beliefs and intentions of other agents to the foreground. We believe developing novel techniques for such theory of mind reasoning will not only be crucial for success in Hanabi, but also in broader collaborative efforts, especially those with human partners. To facilitate future research, we introduce the open-source Hanabi Learning Environment, propose an experimental framework for the research community to evaluate algorithmic advances, and assess the performance of current state-of-the-art techniques. (C) 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:19
相关论文
共 50 条
  • [1] A New Challenge: Approaching Tetris Link with AI
    Muller-Brockhausen, Matthias
    Preuss, Mike
    Plaat, Aske
    2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 143 - 150
  • [2] Official International Mahjong: A New Playground for AI Research
    Lu, Yunlong
    Li, Wenxin
    Li, Wenlong
    ALGORITHMS, 2023, 16 (05)
  • [3] The Hidden Rules of Hanabi: How Humans Outperform AI Agents
    Sidji, Matthew
    Smith, Wally
    Rogerson, Melissa J.
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023), 2023,
  • [4] Mjx: A framework for Mahjong AI research
    Koyamada, Sotetsu
    Habara, Keigo
    Goto, Nao
    Okano, Shinri
    Nishimori, Soichiro
    Ishii, Shin
    2022 IEEE CONFERENCE ON GAMES, COG, 2022, : 504 - 507
  • [5] Social Media As a New Challenge for Political Participation Research
    Svelch, Jaroslav
    Vochocova, Lenka
    SOCIOLOGICKY CASOPIS-CZECH SOCIOLOGICAL REVIEW, 2015, 51 (01): : 65 - 87
  • [6] Extracellular Vesicles: A New Frontier for Research in Acute Respiratory Distress Syndrome
    Mahida, Rahul Y.
    Matsumoto, Shotaro
    Matthay, Michael A.
    AMERICAN JOURNAL OF RESPIRATORY CELL AND MOLECULAR BIOLOGY, 2020, 63 (01) : 15 - 24
  • [7] Exploring the frontier of anthropomorphism in AI agents: Trends and way forward
    Chaturvedi, Rijul
    Verma, Sanjeev
    Srivastava, Vartika
    Khot, Shailesh Sampat
    BUSINESS AND SOCIETY REVIEW, 2025,
  • [8] Strength Adjustment and Assessment for MCTS-Based Programs [Research Frontier]
    Liu, An-Jen
    Wu, Ti-Rong
    Wu, I-Chen
    Guei, Hung
    Wei, Ting-Han
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2020, 15 (03) : 60 - 73
  • [9] Recent Research on AI in Games
    Xia, Boming
    Ye, Xiaozhen
    Abuassba, Adnan O. M.
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 505 - 510
  • [10] A Game AI Competition to Foster Collaborative AI Research and Development
    Salta, Ana
    Prada, Rui
    Melo, Francisco S.
    IEEE TRANSACTIONS ON GAMES, 2021, 13 (04) : 398 - 409