A self-adaptive predictive policy for pursuit-evasion game

被引:0
|
作者
Luo, Zhen [1 ]
Cao, Qi-Xin [1 ]
Zhao, Yan-Zheng [1 ]
机构
[1] Shanghai Jiao Tong Univ, Res Inst Robot, Shanghai 200240, Peoples R China
关键词
action preference; payoff function; predictive; pursuit-evasion games; self-adaptive;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The proposed self-adaptive predictive pursuing policy consists of an action decision-making procedure and a procedure of adjusting the estimation of evader's action preference, Since correct estimation of opponent's intention would do good to win adversarial games, it introduces the conception of action preference to model opponent's decision-making. Because evader often has different action preference in different situation, to model evader's decision-making, pursuer has to divide the situation space into many categories and provide a set of estimation of evader's action preference for each kind of situation. Pursuer adjusts the estimation of evader's action preference in certain situation by observing evader's action. Action decision-making procedure consists of situation sorting, possible future states computation, payoff evaluation and action selection. Action decision-making is based on the decision tree constructed by expected payoffs. Expected payoffs are integrated from single payoffs. Single payoffs are evaluated by gains of features reflecting adversarial situation. A simulation of middle size soccer robots has been carried out and illustrated that the proposed policy is effective.
引用
收藏
页码:1397 / 1407
页数:11
相关论文
共 50 条
  • [41] Optimal pursuit-evasion paths in a game on complete cone
    Hovakimyan, N
    Melikyan, A
    PROCEEDINGS OF THE 2000 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2000, : 3865 - 3869
  • [42] An Asymmetric Version of the Two Car Pursuit-Evasion Game
    Exarchos, Ioannis
    Tsiotras, Panagiotis
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 4272 - 4277
  • [43] Construction of barrier surfaces in a pursuit-evasion game problem
    Zhelnin, YN
    Utemov, AE
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 2005, 44 (05) : 753 - 760
  • [44] Numerical Approximation for A Visibility Based Pursuit-Evasion Game
    Bhattacharya, Sourabh
    Basar, Tamer
    Falcone, Maurizio
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 68 - 75
  • [45] Stochastic Control in a Determinate Differential Pursuit-Evasion Game
    Krasovskii, N. N.
    Kotel'nikova, A. N.
    AUTOMATION AND REMOTE CONTROL, 2011, 72 (02) : 305 - 322
  • [46] SOLUTION OF A GENERAL STOCHASTIC LINEAR PURSUIT-EVASION GAME
    NICHOLS, WG
    TSOKOS, CP
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1976, 7 (07) : 811 - 819
  • [47] Hybrid Intelligent Systems Applied to The Pursuit-Evasion Game
    Desouky, Sameh F.
    Schwartz, Howard M.
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2603 - 2608
  • [48] On the MTO production operations inspired by the pursuit-evasion game
    Wang, Fan
    Li, Jia-Jun
    He, Peng-Ju
    Xue, Ying
    MODERN PHYSICS LETTERS B, 2021, 35 (05):
  • [49] Application of the hp-adaptive pseudospectral method in spacecraft orbit pursuit-evasion game
    Zhang, Zhongtao
    Zhang, Yakun
    Wang, Bin
    ADVANCES IN SPACE RESEARCH, 2024, 73 (03) : 1597 - 1610
  • [50] Adaptive Double Fuzzy Systems Based Q-Learning for Pursuit-Evasion Game
    Liu, Shuaizheng
    Hu, Xiaoxiang
    Dong, Kejun
    IFAC PAPERSONLINE, 2022, 55 (03): : 251 - 256