An Object Oriented Approach to Fuzzy Actor-Critic Learning for Multi-Agent Differential Games

被引:0
作者
Schwartz, Howard [1 ]
机构
[1] Carleton Univ, Dept Syst & Comp Engn, 1125 Colonel By Dr, Ottawa, ON, Canada
来源
2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019) | 2019年
关键词
reinforcement learning; fuzzy systems; differential games; actor critic learning; multi-agent systems; CONTROLLERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new form of the multi-agent fuzzy actor-critic learning algorithm for differential games. An object oriented approach to defining the relationships between agents is proposed. We define the fuzzy inference system as a network structure and define attributes of the agents as rule sets that fired and rewards associated with the fired rule set. The resulting fuzzy actor-critic reinforcement learning algorithm is investigated for playing the differential pursuer super evader game. The game is played in a continuous state and action space to simulate a real world environment. All the robots in the game are simultaneously learning.
引用
收藏
页码:183 / 190
页数:8
相关论文
共 18 条
[1]  
Al Faiya B.M., 2012, 2012 20 MEDITERRANEA, P247
[2]  
Analikwu CV, 2017, INT J INNOV COMPUT I, V13, P1855, DOI 10.24507/ijicic.13.06.1855
[3]  
[Anonymous], 2010, IEEE INT C SYST MAN
[4]  
[Anonymous], 2009, P IEEE SYST MAN CYB
[5]   A Decentralized Fuzzy Learning Algorithm for Pursuit-Evasion Differential Games with Superior Evaders [J].
Awheda, Mostafa D. ;
Schwartz, Howard M. .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 83 (01) :35-53
[6]  
Awheda MD, 2015, CAN CON EL COMP EN, P1006, DOI 10.1109/CCECE.2015.7129412
[7]  
Carlos D., 2008, P 25 INT C MACH LEAR, P240, DOI 10.1145/1390156.1390187
[8]   An approach to tune fuzzy controllers based on reinforcement learning for autonomous vehicle control [J].
Dai, X ;
Li, CK ;
Rad, AB .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2005, 6 (03) :285-293
[9]  
Desouky S., 2009, 2009 IEEE INT C SYST, P2683
[10]   Q(λ)-learning adaptive fuzzy logic controllers for pursuit-evasion differential games [J].
Desouky, Sameh F. ;
Schwartz, Howard M. .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2011, 25 (10) :910-927