A Decentralized Fuzzy Learning Algorithm for Pursuit-Evasion Differential Games with Superior Evaders

被引:45
作者
Awheda, Mostafa D. [1 ]
Schwartz, Howard M. [1 ]
机构
[1] Carleton Univ, Dept Syst & Comp Engn, 1125 Colonel Dr, Ottawa, ON K1S 5B6, Canada
关键词
Fuzzy control; Reinforcement learning; Pursuit-evasion differential games; Apollonius circles;
D O I
10.1007/s10846-015-0315-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider multi-pursuer single-superior-evader pursuit-evasion differential games where the evader has a speed that is similar to or higher than the speed of each pursuer. A new fuzzy reinforcement learning algorithm is proposed in this work. The proposed algorithm uses the well-known Apollonius circle mechanism to define the capture region of the learning pursuer based on its location and the location of the superior evader. The proposed algorithm uses the Apollonius circle with a developed formation control approach in the tuning mechanism of the fuzzy logic controller (FLC) of the learning pursuer so that one or some of the learning pursuers can capture the superior evader. The formation control mechanism used by the proposed algorithm guarantees that the pursuers are distributed around the superior evader in order to avoid collision between pursuers. The formation control mechanism used by the proposed algorithm also makes the Apollonius circles of each two adjacent pursuers intersect or be at least tangent to each other so that the capture of the superior evader can occur. The proposed algorithm is a decentralized algorithm as no communication among the pursuers is required. The only information the proposed algorithm requires is the position and the speed of the superior evader. The proposed algorithm is used to learn different multi-pursuer single-superior-evader pursuit-evasion differential games. The simulation results show the effectiveness of the proposed algorithm.
引用
收藏
页码:35 / 53
页数:19
相关论文
共 46 条
[1]   Mission-Driven Robotic Intelligent Sensor Agents for Territorial Security [J].
Abielmona, Rami ;
Petriu, Emil M. ;
Harb, Moufid ;
Wesolkowski, Slawo .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2011, 6 (01) :55-67
[2]  
[Anonymous], 2006, Planning algorithms
[3]  
[Anonymous], 2007, AIAA Journal of Aerospace Computing, Information, and Communication, DOI [10.2514/1.25329, DOI 10.2514/1.25329]
[4]  
[Anonymous], 2013, 2013 4 INT C COMP CO
[5]  
Awheda M. D., 2015, P 28 IEEE CAN C EL C, P3
[6]  
Bakolas E, 2013, IEEE DECIS CONTR P, P1472, DOI 10.1109/CDC.2013.6760090
[7]  
Bao-Fu F., 2012, INF TECHNOL J, V11
[8]  
Becerra I, 2015, IEEE INT CONF ROBOT, P4768, DOI 10.1109/ICRA.2015.7139862
[9]  
Bhattacharya S, 2014, IEEE INT C INT ROBOT, P68, DOI 10.1109/IROS.2014.6942542
[10]  
Busoniu L, 2008, P 17 IFAC WORLD C IF