Markov decision evolutionary game theoretic learning for cooperative sensing of unmanned aerial vehicles

被引:14
作者
Sun ChangHao [1 ]
Duan HaiBin [1 ,2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Bioinspired Autonomous Flight Syst BAFS Res Grp, Sci & Technol Aircraft Control Lab, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
unmanned aerial vehicles (UAVs); iterated prisoner's dilemma (IPD); Markov decision evolutionary game (MDEG); replicator dynamics; cooperation; OPTIMIZATION; STRATEGIES; ALLOCATION; ALGORITHM; NETWORKS;
D O I
10.1007/s11431-015-5848-6
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
As one of the major contributions of biology to competitive decision making, evolutionary game theory provides a useful tool for studying the evolution of cooperation. To achieve the optimal solution for unmanned aerial vehicles (UAVs) that are carrying out a sensing task, this paper presents a Markov decision evolutionary game (MDEG) based learning algorithm. Each individual in the algorithm follows a Markov decision strategy to maximize its payoff against the well known Tit-for-Tat strategy. Simulation results demonstrate that the MDEG theory based approach effectively improves the collective payoff of the team. The proposed algorithm can not only obtain the best action sequence but also a sub-optimal Markov policy that is independent of the game duration. Furthermore, the paper also studies the emergence of cooperation in the evolution of self-regarded UAVs. The results show that it is the adaptive ability of the MDEG based approach as well as the perfect balance between revenge and forgiveness of the Tit-for-Tat strategy that the emergence of cooperation should be attributed to.
引用
收藏
页码:1392 / 1400
页数:9
相关论文
共 42 条
[1]   Optimality and complexity of pure Nash equilibria in the coverage game [J].
Ai, Xin ;
Srinivasan, Vikram ;
Tham, Chen-Khong .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2008, 26 (07) :1170-1182
[2]  
Altman E, 2008, IEEE INFOCOM SER, P2432
[3]   Markov Decision Evolutionary Games [J].
Altman, Eitan ;
Hayel, Yezekael .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (07) :1560-1569
[4]   Chemical evolutionary games [J].
Aristotelous, Andreas C. ;
Durrett, Richard .
THEORETICAL POPULATION BIOLOGY, 2014, 93 :1-13
[5]   A computational evolutionary approach to evolving game strategy and cooperation [J].
Azuaje, F .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2003, 33 (03) :498-502
[6]   Short Versus Long Term Benefits and the Evolution of Cooperation in the Prisoner's Dilemma Game [J].
Brede, Markus .
PLOS ONE, 2013, 8 (02)
[7]   Influence of different initial distributions on robust cooperation in scale-free networks: A comparative study [J].
Chen, Xiaojie ;
Fu, Feng ;
Wang, Long .
PHYSICS LETTERS A, 2008, 372 (08) :1161-1167
[8]   A model for parameter estimation of multistage centrifugal compressor and compressor performance analysis using genetic algorithm [J].
Chu Fei ;
Wang FuLi ;
Wang XiaoGang ;
Zhang ShuNing .
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2012, 55 (11) :3163-3175
[9]   Effects of expectation and noise on evolutionary games [J].
Du, Wen-Bo ;
Cao, Xian-Bin ;
Hu, Mao-Bin ;
Yang, Han-Xin ;
Zhou, Hong .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2009, 388 (11) :2215-2220
[10]   Swarm intelligence inspired shills and the evolution of cooperation [J].
Duan, Haibin ;
Sun, Changhao .
SCIENTIFIC REPORTS, 2014, 4