Improved Reinforcement Learning in Asymmetric Real-time Strategy Games via Strategy Diversity

被引:2
|
作者
Dasgupta, Prithviraj [1 ]
Kliem, John [1 ]
机构
[1] US Naval Res Lab, Informat Technol Div, Washington, DC 20375 USA
来源
关键词
Real-time strategy game; asymmetric game; deep reinforcement learning; strategy diversity; LEVEL; GO;
D O I
10.17083/ijsg.v10i1.548
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
We investigate the use of artificial intelligence (AI)-based techniques in learning to play a 2-player, real-time strategy (RTS) game called Hunting-of-the-Plark. The game is challenging to play for both humans and AI-based techniques because players cannot observe each other's moves while playing the game and one player is at a disadvantage due to the asymmetric nature of the game rules. We analyze the performance of different deep reinforcement learning algorithms to train software agents that can play the game. Existing reinforcement learning techniques for RTS games enable players to converge towards an equilibrium outcome of the game but usually do not facilitate further explo-ration of techniques to exploit and defeat the opponent. To address this shortcoming, we investigate techniques including self-play and strategy diversity that can be used by players to improve their performance beyond the equilibrium outcome. We observe that when players use self-play, their number of wins begins to cycle around an equilibrium value as each player quickly learns to outwit and defeat its opponent and vice-versa. Fi-nally, we show that strategy diversity could be used as an effective means to alleviate the performance of the disadvantaged player caused by the asymmetric nature of the game.
引用
收藏
页码:19 / 38
页数:20
相关论文
共 50 条
  • [31] Real-time Dispatch Strategy for Electric Vehicles Based on Deep Reinforcement Learning
    Li H.
    Li G.
    Wang K.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (22): : 161 - 167
  • [32] Real-time heliostat field aiming strategy optimization based on reinforcement learning
    Zeng, Zhichen
    Ni, Dong
    Xiao, Gang
    APPLIED ENERGY, 2022, 307
  • [33] RTSenv: An Experimental Environment for Real-Time Strategy Games
    Shen, Siqi
    Visser, Otto
    Iosup, Alexandru
    2011 10TH ANNUAL WORKSHOP ON NETWORK AND SYSTEMS SUPPORT FOR GAMES (NETGAMES 2011), 2011,
  • [34] Procedural Content Generation for Real-Time Strategy Games
    Lara-Cabrera, Raul
    Nogueira-Collazo, Mariela
    Cotta, Carlos
    Fernandez-Leiva, Antonio J.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2015, 3 (02): : 40 - 48
  • [35] Explaining How to Play Real-Time Strategy Games
    Metoyer, Ronald
    Stumpf, Simone
    Neumann, Christoph
    Dodge, Jonathan
    Cao, Jill
    Schnabel, Aaron
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 249 - +
  • [36] Contextual Combinatorial Bandits in Real-Time Strategy Games
    Yang, Zuozhi
    Ontanon, Santiago
    2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 735 - 743
  • [37] Intelligent Moving of Groups in Real-Time Strategy Games
    Danielsiek, Holger
    Stueer, Raphael
    Thom, Andreas
    Beume, Nicola
    Naujoks, Boris
    Preuss, Mike
    2008 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND GAMES, 2008, : 71 - 78
  • [38] Explaining how to play real-time strategy games
    Metoyer, Ronald
    Stumpf, Simone
    Neumann, Christoph
    Dodge, Jonathan
    Cao, Jill
    Schnabel, Aaron
    KNOWLEDGE-BASED SYSTEMS, 2010, 23 (04) : 295 - 301
  • [39] Artificial Intelligence Techniques on Real-time Strategy Games
    Yang Zhen
    Zhang Wanpeng
    Liu Hongfu
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 11 - 21
  • [40] A Systematic Review of Coevolution in Real-Time Strategy Games
    Elfeky, Ehab Z.
    Elsayed, Saber
    Marsh, Luke
    Essam, Daryl
    Cochrane, Madeleine
    Sims, Brendan
    Sarker, Ruhul
    IEEE ACCESS, 2021, 9 : 136647 - 136665