Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model

被引:3
作者
Wang, Chengjie [1 ,2 ]
Deng, Juan [3 ]
Zhao, Hui [4 ]
Li, Li [1 ,2 ]
机构
[1] Tongji Univ, Dept Control Sci & Engn, Coll Elect & Informat Engn, 4800 Caoan Highway, Shanghai 201804, Peoples R China
[2] Shanghai Res Inst Intelligent Autonomous Syst, Shanghai 200120, Peoples R China
[3] Natl Univ Def Technol, Coll Sci, Changsha 410073, Peoples R China
[4] Tongji Univ, Coll Elect & Informat Engn, Dept Comp & Sci, 4800 Caoan Highway, Shanghai 201804, Peoples R China
基金
中国国家自然科学基金;
关键词
Vicsek model; Evolutionary game theory; Q-learning; Cooperation; DYNAMICS;
D O I
10.1016/j.amc.2024.128956
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
There have been numerous studies on collective behavior, among which communication between agents can have a great impact on both the payoff and the cost of making decisions. Research usually focuses on how to improve the collective synchronization rate or accelerate the process of cooperation under given communication cost constraints. In this context, evolutionary game theory (EGT) and reinforcement learning (RL) arise as essential frameworks for tackling this intricate problem. In this study, an adapted Vicsek model is introduced, wherein agents exhibit varying movement patterns contingent on their chosen strategies. Each agent gains a payoff determined by the advantages of collective motion juxtaposed with the cost of communicating with neighboring agents. Individuals choose the objective agents based on the Q-learning strategy and then adapt their strategies following the Fermi rule. The research reveals that the utmost level of cooperation and synchronization can be attained at an optimal communication radius after applying Q-learning. Similar conclusions have been drawn from research on the influence of random noise and relative cost. Different cost functions were considered in the study to demonstrate the robustness of the proposed model and conclusions under a wide range of conditions. ( https://github .com /WangchengjieT /VM-EGT-Q) )
引用
收藏
页数:11
相关论文
共 35 条
[11]   Study on taxi mode selection dynamics based on evolutionary game theory [J].
Li, Kun ;
Sun, Xiaodi .
CHAOS SOLITONS & FRACTALS, 2024, 180
[12]   Simulated dynamics of virus spreading on social networks with various topologies [J].
Li, Kun ;
Chen, Zhiyu ;
Cong, Rui ;
Zhang, Jianlei ;
Wei, Zhenlin .
APPLIED MATHEMATICS AND COMPUTATION, 2024, 470
[13]   Resonance-like cooperation due to transaction costs in the prisoner's dilemma game [J].
Li, Yumeng ;
Wang, Hanchen ;
Du, Wenbo ;
Perc, Matjaz ;
Cao, Xianbin ;
Zhang, Jun .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 521 :248-257
[14]   Effects of compassion on the evolution of cooperation in spatial social dilemmas [J].
Li, Yumeng ;
Zhang, Jun ;
Perc, Matjaz .
APPLIED MATHEMATICS AND COMPUTATION, 2018, 320 :437-443
[15]   Engineering the synergistic effect of carbon dots-stabilized atomic and subnanometric ruthenium as highly efficient electrocatalysts for robust hydrogen evolution [J].
Liu, Yuan ;
Chen, Ning ;
Li, Weidong ;
Sun, Mingzi ;
Wu, Tong ;
Huang, Bolong ;
Yong, Xue ;
Zhang, Qinghua ;
Gu, Lin ;
Song, Haoqiang ;
Bauer, Robert ;
Tse, John S. ;
Zang, Shuang-Quan ;
Yang, Bai ;
Lu, Siyu .
SMARTMAT, 2022, 3 (02) :249-259
[16]   An improved Vicsek model of swarm based on remote neighbors strategy [J].
Lu, Xinbiao ;
Zhang, Chi ;
Qin, Buzhi .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 587
[17]   Coevolutionary games-A mini review [J].
Perc, Matjaz ;
Szolnoki, Attila .
BIOSYSTEMS, 2010, 99 (02) :109-125
[18]   A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play [J].
Silver, David ;
Hubert, Thomas ;
Schrittwieser, Julian ;
Antonoglou, Ioannis ;
Lai, Matthew ;
Guez, Arthur ;
Lanctot, Marc ;
Sifre, Laurent ;
Kumaran, Dharshan ;
Graepel, Thore ;
Lillicrap, Timothy ;
Simonyan, Karen ;
Hassabis, Demis .
SCIENCE, 2018, 362 (6419) :1140-+
[19]   Evolution of prosocial behaviours in multilayer populations [J].
Su, Qi ;
McAvoy, Alex ;
Mori, Yoichiro ;
Plotkin, Joshua B. .
NATURE HUMAN BEHAVIOUR, 2022, 6 (03) :338-+
[20]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1