Promoting human-AI interaction makes a better adoption of deep reinforcement learning: a real-world application in game industry

被引：3

作者：

Hu, Zhipeng ^{[1
,2
]}

Liu, Haoyu ^{[2
]}

Xiong, Yu ^{[2
]}

Wang, Lizi ^{[2
]}

Wu, Runze ^{[2
]}

Guan, Kai ^{[2
]}

Hu, Yujing ^{[2
]}

Lyu, Tangjie ^{[2
]}

Fan, Changjie ^{[2
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China

[2] NetEase Inc, Fuxi AI Lab, Hangzhou, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 2期

关键词：

Game AI; Deep reinforcement learning; Explainable reinforcement learning; Human-AI interaction; GO;

D O I：

10.1007/s11042-023-15361-6

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep reinforcement learning (DRL) has been widely employed in game industry, mainly for building automatic game agents. While its performance and efficiency has significantly outperformed traditional approaches, the lack of model transparency constrains the interaction between the model and the human operators, thus degrading the practicality of DRL methods. In this paper, we propose to mitigate this human-AI interaction issue in a game industry scenario. Previously, existing methods need repetitive execution of DRL or are designed towards specific tasks, which are not applicable for our deployment scenario. Considering that different games could have different DRL AI agents, we hereby develop a post-hoc explanation framework which regards original DRL as a black-box model and can be applicable to any DRL based agents. Within the framework, a specially selected student model, which has been already well explored for model explanation, is employed to learn the decision policies of the trained DRL model. Then, by giving explanation information for the student model, indirect but practical explanation results can be obtained for original DRL model. Based on this information, the interaction between human and AI agents can be enhanced, benefiting deployment of DRL. Finally, based on the dataset from a real-world production game, we conduct experiments and user studies to illustrate the effectiveness of the proposed procedure from both objective and subjective perspectives.

引用

页码：6161 / 6182

页数：22

共 55 条

[1] Guest editorial: Interaction in immersive experiences
Agius, Harry
Daylamani-Zad, Damon
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30939 - 30942
[2] Amir D, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1168
[3] Anderson A, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1328
[4] [Anonymous], 2012, P SIGCHI C HUM FACT, DOI [DOI 10.1145/2207676.2207680, 10.1145/2207676.2207680]
[5] Deep Reinforcement Learning A brief survey
Arulkumaran, Kai
Deisenroth, Marc Peter
Brundage, Miles
Bharath, Anil Anthony
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
[6] Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
Barredo Arrieta, Alejandro
Diaz-Rodriguez, Natalia
Del Ser, Javier
Bennetot, Adrien
Tabik, Siham
Barbado, Alberto
Garcia, Salvador
Gil-Lopez, Sergio
Molina, Daniel
Benjamins, Richard
Chatila, Raja
Herrera, Francisco
[J]. INFORMATION FUSION, 2020, 58 : 82 - 115
[7] Berner C., 2019, arXiv
[8] Explainable Machine Learning in Deployment
Bhatt, Umang
Xiang, Alice
Sharma, Shubham
Weller, Adrian
Taly, Ankur
Jia, Yunhan
Ghosh, Joydeep
Puri, Ruchir
Moura, Jose M. F.
Eckersley, Peter
[J]. FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 648 - 657
[9] Bagging predictors
Breiman, L
[J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
[10] Augmented reality technologies, systems and applications
Carmigniani, Julie
Furht, Borko
Anisetti, Marco
Ceravolo, Paolo
Damiani, Ernesto
Ivkovic, Misa
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 51 (01) : 341 - 377

← 1 2 3 4 5 6 →