The Hanabi challenge: A new frontier for AI research

被引:114
|
作者
Bard, Nolan [1 ]
Foerster, Jakob N. [2 ]
Chandar, Sarath [3 ]
Burch, Neil [1 ]
Lanctot, Marc [1 ]
Song, H. Francis [4 ]
Parisotto, Emilio [5 ]
Dumoulin, Vincent [3 ]
Moitra, Subhodeep [3 ]
Hughes, Edward [4 ]
Dunning, Iain [4 ]
Mourad, Shibl [6 ]
Larochelle, Hugo [3 ]
Bellemare, Marc G. [3 ]
Bowling, Michael [1 ]
机构
[1] DeepMind, Edmonton, AB, Canada
[2] Univ Oxford, Oxford, England
[3] Google Brain, Montreal, PQ, Canada
[4] DeepMind, London, England
[5] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[6] DeepMind, Montreal, PQ, Canada
关键词
Multi-agent learning; Challenge paper; Reinforcement learning; Games; Theory of mind; Communication; Imperfect information; Cooperative; ARCADE LEARNING-ENVIRONMENT; COMPREHENSIVE SURVEY; REINFORCEMENT; GAME; GO; POKER;
D O I
10.1016/j.artint.2019.103216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the early days of computing, games have been important testbeds for studying how well machines can do sophisticated decision making. In recent years, machine learning has made dramatic advances with artificial agents reaching superhuman performance in challenge domains like Go, Atari, and some variants of poker. As with their predecessors of chess, checkers, and backgammon, these game domains have driven research by providing sophisticated yet well-defined challenges for artificial intelligence practitioners. We continue this tradition by proposing the game of Hanabi as a new challenge domain with novel problems that arise from its combination of purely cooperative gameplay with two to five players and imperfect information. In particular, we argue that Hanabi elevates reasoning about the beliefs and intentions of other agents to the foreground. We believe developing novel techniques for such theory of mind reasoning will not only be crucial for success in Hanabi, but also in broader collaborative efforts, especially those with human partners. To facilitate future research, we introduce the open-source Hanabi Learning Environment, propose an experimental framework for the research community to evaluate algorithmic advances, and assess the performance of current state-of-the-art techniques. (C) 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:19
相关论文
共 50 条
  • [41] EvoCraft: A New Challenge for Op en-Endedness
    Grbic, Djordje
    Palm, Rasmus Berg
    Najarro, Elias
    Glanois, Claire
    Risi, Sebastian
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2021, 2021, 12694 : 325 - 340
  • [42] A bibliometric study of the research area of videogames using Dimensions.ai database
    Garcia-Sanchez, Pablo
    Mora, Antonio M.
    Castillo, Pedro A.
    Perez, Ignacio J.
    7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 : 737 - 744
  • [43] The new information frontier: toward a more nuanced view of social movement communication
    Earl, Jennifer
    Garrett, R. Kelly
    SOCIAL MOVEMENT STUDIES, 2017, 16 (04) : 479 - 493
  • [44] Comprehensive Care for Mechanical Circulatory Support A New Frontier for Synergy With Palliative Care
    Goldstein, Nathan E.
    May, Christopher W.
    Meier, Diane E.
    CIRCULATION-HEART FAILURE, 2011, 4 (04) : 519 - +
  • [45] Rogue-Gym: A New Challenge for Generalization in Reinforcement Learning
    Kanagawa, Yuji
    Kaneko, Tomoyuki
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [46] The new features of interpretation centers. A challenge for communication professionals
    Botto, Marcelo
    QUESTION, 2024, 3 (77):
  • [47] The Animal-AI Environment: A virtual laboratory for comparative cognition and artificial intelligence research
    Voudouris, Konstantinos
    Slater, Ben
    Cheke, Lucy G.
    Schellaert, Wout
    Hernandez-Orallo, Jose
    Halina, Marta
    Patel, Matishalin
    Alhas, Ibrahim
    Mecattaf, Matteo G.
    Burden, John
    Holmes, Joel
    Chaubey, Niharika
    Donnelly, Niall
    Crosby, Matthew
    BEHAVIOR RESEARCH METHODS, 2025, 57 (04)
  • [48] VR-Dialogue - The AI-powered e-Participation Research Expedition
    Porwol, Lukasz
    Dumas, Catherine Leigh
    PROCEEDINGS OF THE 22ND ANNUAL INTERNATIONAL CONFERENCE ON DIGITAL GOVERNMENT RESEARCH, DGO 2021, 2021, : 587 - 589
  • [49] ADVERTISING RESEARCH IN THE POST-WTO DECADE IN CHINA Meeting the Internationalization Challenge
    Hung, Kineta
    Tse, Caleb H.
    Cheng, Shirley Y. Y.
    JOURNAL OF ADVERTISING, 2012, 41 (03) : 121 - 145
  • [50] Just what are we doing when we're describing AI? Harvey Sacks, the commentator machine, and the descriptive politics of the new artificial intelligence
    Mair, Michael
    Brooker, Phillip
    Dutton, William
    Sormani, Philippe
    QUALITATIVE RESEARCH, 2021, 21 (03) : 341 - 359