The Hidden Rules of Hanabi: How Humans Outperform AI Agents

被引:1
作者
Sidji, Matthew [1 ]
Smith, Wally [1 ]
Rogerson, Melissa J. [1 ]
机构
[1] Univ Melbourne, Melbourne, Vic, Australia
来源
PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023) | 2023年
关键词
Human-AI Interaction; boardgames; Human-Computer Interaction; teaming; Human-AI Teaming; social roles; theory of mind; MENTAL MODELS; PERFORMANCE; SUPPORT; DESIGN;
D O I
10.1145/3544548.3581550
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Games that feature multiple players, limited communication, and partial information are particularly challenging for AI agents. In the cooperative card game Hanabi, which possesses all of these attributes, AI agents fail to achieve scores comparable to even first-time human players. Through an observational study of three mixed-skill Hanabi play groups, we identify the techniques used by humans that help to explain their superior performance compared to AI. These concern physical artefact manipulation, coordination play, role establishment, and continual rule negotiation. Our findings extend previous accounts of human performance in Hanabi, which are purely in terms of theory-of-mind reasoning, by revealing more precisely how this form of collective decision-making is enacted in skilled human play. Our interpretation points to a gap in the current capabilities of AI agents to perform cooperative tasks.
引用
收藏
页数:16
相关论文
共 77 条
[1]   Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].
Adadi, Amina ;
Berrada, Mohammed .
IEEE ACCESS, 2018, 6 :52138-52160
[2]   Mental Models of Mere Mortals with Explanations of Reinforcement Learning [J].
Anderson, Andrew ;
Dodge, Jonathan ;
Sadarangani, Amrita ;
Juozapaitis, Zoe ;
Newman, Evan ;
Irvine, Jed ;
Chattopadhyay, Souti ;
Olson, Matthew ;
Fern, Alan ;
Burnett, Margaret .
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2020, 10 (02)
[3]  
[Anonymous], 1996, COGNITION COMMUNICAT
[4]  
[Anonymous], 2006, Human-Machine Reconfigurations: Plans and Situated Actions
[5]  
Bansal Gagan, 2019, P AAAI C HUM COMP CR, P2, DOI DOI 10.1609/HCOMP.V7I1.5285
[6]  
Baptista S, 2020, DIABETES TECHNOL THE, V22, pA221
[7]   The Hanabi challenge: A new frontier for AI research [J].
Bard, Nolan ;
Foerster, Jakob N. ;
Chandar, Sarath ;
Burch, Neil ;
Lanctot, Marc ;
Song, H. Francis ;
Parisotto, Emilio ;
Dumoulin, Vincent ;
Moitra, Subhodeep ;
Hughes, Edward ;
Dunning, Iain ;
Mourad, Shibl ;
Larochelle, Hugo ;
Bellemare, Marc G. ;
Bowling, Michael .
ARTIFICIAL INTELLIGENCE, 2020, 280
[8]   Reinforcement Learning for the Adaptive Scheduling of Educational Activities [J].
Bassen, Jonathan ;
Balaji, Bharathan ;
Schaarschmidt, Michael ;
Thille, Candace ;
Painter, Jay ;
Zimmaro, Dawn ;
Gamest, Alex ;
Fast, Ethan ;
Mitchell, John C. .
PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
[9]   Systematic Review and Inventory of Theory of Mind Measures for Young Children [J].
Beaudoin, Cindy ;
Leblanc, Elizabel ;
Gagner, Charlotte ;
Beauchamp, Miriam H. .
FRONTIERS IN PSYCHOLOGY, 2020, 10
[10]  
Bendell Rhyse, 2021, Advances in Neuroergonomics and Cognitive Engineering. Proceedings of the AHFE 2021 Virtual Conferences on Neuroergonomics and Cognitive Engineering, Industrial Cognitive Ergonomics and Engineering Psychology, and Cognitive Computing and Internet of Things. Lecture Notes in Networks and Systems (LNNS 259), P20, DOI 10.1007/978-3-030-80285-1_3