A Dynamics Perspective of Pursuit-Evasion Games of Intelligent Agents with the Ability to Learn

被引:0
作者
Xiong, Hao [1 ]
Cao, Huanhui [1 ]
Lu, Wenjie [1 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Mech Engn & Automat, Shenzhen 518055, Peoples R China
来源
2022 41ST CHINESE CONTROL CONFERENCE (CCC) | 2022年
关键词
dynamics; reinforcement learning; pursuit-evasion games;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pursuit-evasion games are ubiquitous in nature and in an artificial world. In nature, pursuer(s) and evader( s) are intelligent agents that can learn from experience, and dynamics (i.e., Newtonian or Lagrangian) are vital for the pursuer and the evader in some scenarios. To this end, this paper addresses the pursuit-evasion game of intelligent agents from the perspective of dynamics. A bio-inspired dynamics formulation of a pursuit-evasion game and baseline pursuit and evasion strategies are introduced at first. Then, reinforcement learning techniques are used to mimic the ability of intelligent agents to learn from experience. Based on the dynamics formulation and reinforcement learning techniques, the effects of improving both pursuit and evasion strategies based on experience in pursuit-evasion games are investigated at two levels : I) individual runs and 2) ranges of the parameters of pursuit-evasion games. The results of the investigation are consistent with nature observations and the natural law - survival of the fittest. More importantly, with respect to the result of a pursuit-evasion game of agents with baseline strategies, this study achieves a different result. It is shown that, in a pursuit-evasion game with a dynamics formulation, an evader is not able to escape from a slightly faster pursuer with an effective learned pursuit strategy, based on agile maneuvers and an effective learned evasion strategy.
引用
收藏
页码:7082 / 7087
页数:6
相关论文
共 50 条
  • [1] Pursuit-evasion games in the presence of obstacles
    Oyler, Dave W.
    Kabamba, Pierre T.
    Girard, Anouck R.
    AUTOMATICA, 2016, 65 : 1 - 11
  • [2] Pursuit-Evasion Games of High Speed Evader
    Ramana, M. V.
    Kothari, Mangal
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (02) : 293 - 306
  • [3] Pursuit-Evasion Games of High Speed Evader
    M. V. Ramana
    Mangal Kothari
    Journal of Intelligent & Robotic Systems, 2017, 85 : 293 - 306
  • [4] Team formation through an assessor: choosing MARL agents in pursuit-evasion games
    Zhao, Yue
    Ju, Lushan
    Hernandez-Orallo, Jose
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 3473 - 3492
  • [5] Comparing the power of cops to zombies in pursuit-evasion games
    Offner, David
    Ojakian, Kerry
    DISCRETE APPLIED MATHEMATICS, 2019, 271 : 144 - 151
  • [6] Pursuit-evasion games of multiple cooperative pursuers and an evader: A biological-inspired perspective
    Wang, Jianan
    Li, Guilu
    Liang, Li
    Wang, Chunyan
    Deng, Fang
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2022, 110
  • [7] On applied nonlinear and bilevel programming for pursuit-evasion games
    Ehtamo, H
    Raivio, T
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2001, 108 (01) : 65 - 96
  • [8] Pursuit-Evasion Games with incomplete information in discrete time
    Gurel-Gurevich, Ori
    INTERNATIONAL JOURNAL OF GAME THEORY, 2009, 38 (03) : 367 - 376
  • [9] A Stochastic Characterization of the Capture Zone in Pursuit-Evasion Games
    Battistini, Simone
    GAMES, 2020, 11 (04): : 1 - 10
  • [10] Pursuit-evasion games: a tractable framework for antijamming games in aerial attacks
    Parras, Juan
    Zazo, Santiago
    del Val, Jorge
    Zazo, Javier
    Macua, Sergio Valcarcel
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2017,