The Hanabi challenge: A new frontier for AI research

被引:114
|
作者
Bard, Nolan [1 ]
Foerster, Jakob N. [2 ]
Chandar, Sarath [3 ]
Burch, Neil [1 ]
Lanctot, Marc [1 ]
Song, H. Francis [4 ]
Parisotto, Emilio [5 ]
Dumoulin, Vincent [3 ]
Moitra, Subhodeep [3 ]
Hughes, Edward [4 ]
Dunning, Iain [4 ]
Mourad, Shibl [6 ]
Larochelle, Hugo [3 ]
Bellemare, Marc G. [3 ]
Bowling, Michael [1 ]
机构
[1] DeepMind, Edmonton, AB, Canada
[2] Univ Oxford, Oxford, England
[3] Google Brain, Montreal, PQ, Canada
[4] DeepMind, London, England
[5] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[6] DeepMind, Montreal, PQ, Canada
关键词
Multi-agent learning; Challenge paper; Reinforcement learning; Games; Theory of mind; Communication; Imperfect information; Cooperative; ARCADE LEARNING-ENVIRONMENT; COMPREHENSIVE SURVEY; REINFORCEMENT; GAME; GO; POKER;
D O I
10.1016/j.artint.2019.103216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the early days of computing, games have been important testbeds for studying how well machines can do sophisticated decision making. In recent years, machine learning has made dramatic advances with artificial agents reaching superhuman performance in challenge domains like Go, Atari, and some variants of poker. As with their predecessors of chess, checkers, and backgammon, these game domains have driven research by providing sophisticated yet well-defined challenges for artificial intelligence practitioners. We continue this tradition by proposing the game of Hanabi as a new challenge domain with novel problems that arise from its combination of purely cooperative gameplay with two to five players and imperfect information. In particular, we argue that Hanabi elevates reasoning about the beliefs and intentions of other agents to the foreground. We believe developing novel techniques for such theory of mind reasoning will not only be crucial for success in Hanabi, but also in broader collaborative efforts, especially those with human partners. To facilitate future research, we introduce the open-source Hanabi Learning Environment, propose an experimental framework for the research community to evaluate algorithmic advances, and assess the performance of current state-of-the-art techniques. (C) 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Patient Power Revolution in Multiple Sclerosis: Navigating the New Frontier
    Yeandle, David
    Rieckmann, Peter
    Giovannoni, Gavin
    Alexandri, Nektaria
    Langdon, Dawn
    NEUROLOGY AND THERAPY, 2018, 7 (02) : 179 - 187
  • [32] Communication law as a new challenge for information society
    Buday-santha, Andrea
    INTERSECTIONS-EAST EUROPEAN JOURNAL OF SOCIETY AND POLITICS, 2024, 10 (03): : 236 - 252
  • [33] The New Social Media: Chance or Challenge for Dialogue?
    Kochler, H.
    POLIS-POLITICHESKIYE ISSLEDOVANIYA, 2013, (04): : 75 - +
  • [34] Trust and trustworthy artificial intelligence: A research agenda for AI in the environmental sciences
    Bostrom, Ann
    Demuth, Julie L.
    Wirz, Christopher D.
    Cains, Mariana G.
    Schumacher, Andrea
    Madlambayan, Deianna
    Bansal, Akansha Singh
    Bearth, Angela
    Chase, Randy
    Crosman, Katherine M.
    Ebert-Uphoff, Imme
    Gagne, David John
    Guikema, Seth
    Hoffman, Robert
    Johnson, Branden B.
    Kumler-Bonfanti, Christina
    Lee, John D.
    Lowe, Anna
    McGovern, Amy
    Przybylo, Vanessa
    Radford, Jacob T.
    Roth, Emilie
    Sutter, Carly
    Tissot, Philippe
    Roebber, Paul
    Stewart, Jebb Q.
    White, Miranda
    Williams, John K.
    RISK ANALYSIS, 2024, 44 (06) : 1498 - 1513
  • [35] Shared Decision-Making A New Frontier for Case Management Leadership
    Treiger, Teresa M.
    PROFESSIONAL CASE MANAGEMENT, 2020, 25 (02) : 56 - 76
  • [36] COMMUNICATION COMPETENCIES IN THE DIGITAL AGE: THE NEW FRONTIER OF COMMUNICATION ACROSS THE CURRICULUM
    Dominguez, Andrea M.
    EDULEARN16: 8TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2016, : 8455 - 8464
  • [37] Front-end AI vs. Back-end AI: new framework for securing truth in communication during the generative AI era
    Kim, Donggyu
    Kong, Jungwon
    FRONTIERS IN COMMUNICATION, 2023, 8
  • [38] Communicating with the New Generations. The Challenge for Pediatric Dentists
    Saadia, Marc
    Valencia, Roberto
    JOURNAL OF CLINICAL PEDIATRIC DENTISTRY, 2015, 39 (04) : 297 - 302
  • [39] Early patient contact in primary care:: a new challenge
    Haffling, AC
    Håkansson, A
    Hagander, B
    MEDICAL EDUCATION, 2001, 35 (09) : 901 - 908
  • [40] Mutuality in AI-enabled new public service solutions
    Koskimies, E.
    Kinder, T.
    PUBLIC MANAGEMENT REVIEW, 2024, 26 (01) : 219 - 244