Promoting human-AI interaction makes a better adoption of deep reinforcement learning: a real-world application in game industry

被引:3
作者
Hu, Zhipeng [1 ,2 ]
Liu, Haoyu [2 ]
Xiong, Yu [2 ]
Wang, Lizi [2 ]
Wu, Runze [2 ]
Guan, Kai [2 ]
Hu, Yujing [2 ]
Lyu, Tangjie [2 ]
Fan, Changjie [2 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou, Peoples R China
[2] NetEase Inc, Fuxi AI Lab, Hangzhou, Peoples R China
关键词
Game AI; Deep reinforcement learning; Explainable reinforcement learning; Human-AI interaction; GO;
D O I
10.1007/s11042-023-15361-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning (DRL) has been widely employed in game industry, mainly for building automatic game agents. While its performance and efficiency has significantly outperformed traditional approaches, the lack of model transparency constrains the interaction between the model and the human operators, thus degrading the practicality of DRL methods. In this paper, we propose to mitigate this human-AI interaction issue in a game industry scenario. Previously, existing methods need repetitive execution of DRL or are designed towards specific tasks, which are not applicable for our deployment scenario. Considering that different games could have different DRL AI agents, we hereby develop a post-hoc explanation framework which regards original DRL as a black-box model and can be applicable to any DRL based agents. Within the framework, a specially selected student model, which has been already well explored for model explanation, is employed to learn the decision policies of the trained DRL model. Then, by giving explanation information for the student model, indirect but practical explanation results can be obtained for original DRL model. Based on this information, the interaction between human and AI agents can be enhanced, benefiting deployment of DRL. Finally, based on the dataset from a real-world production game, we conduct experiments and user studies to illustrate the effectiveness of the proposed procedure from both objective and subjective perspectives.
引用
收藏
页码:6161 / 6182
页数:22
相关论文
共 55 条
  • [1] Guest editorial: Interaction in immersive experiences
    Agius, Harry
    Daylamani-Zad, Damon
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (20) : 30939 - 30942
  • [2] Amir D, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1168
  • [3] Anderson A, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1328
  • [4] [Anonymous], 2012, P SIGCHI C HUM FACT, DOI [DOI 10.1145/2207676.2207680, 10.1145/2207676.2207680]
  • [5] Deep Reinforcement Learning A brief survey
    Arulkumaran, Kai
    Deisenroth, Marc Peter
    Brundage, Miles
    Bharath, Anil Anthony
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) : 26 - 38
  • [6] Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
    Barredo Arrieta, Alejandro
    Diaz-Rodriguez, Natalia
    Del Ser, Javier
    Bennetot, Adrien
    Tabik, Siham
    Barbado, Alberto
    Garcia, Salvador
    Gil-Lopez, Sergio
    Molina, Daniel
    Benjamins, Richard
    Chatila, Raja
    Herrera, Francisco
    [J]. INFORMATION FUSION, 2020, 58 : 82 - 115
  • [7] Berner C., 2019, arXiv
  • [8] Explainable Machine Learning in Deployment
    Bhatt, Umang
    Xiang, Alice
    Sharma, Shubham
    Weller, Adrian
    Taly, Ankur
    Jia, Yunhan
    Ghosh, Joydeep
    Puri, Ruchir
    Moura, Jose M. F.
    Eckersley, Peter
    [J]. FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 648 - 657
  • [9] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140
  • [10] Augmented reality technologies, systems and applications
    Carmigniani, Julie
    Furht, Borko
    Anisetti, Marco
    Ceravolo, Paolo
    Damiani, Ernesto
    Ivkovic, Misa
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 51 (01) : 341 - 377