Research on multi-agent strategy based on filtering mechanism to filter information

被引:0
作者
Chen L. [1 ]
Guo T. [1 ]
Liu Y.-T. [1 ]
Yang J.-M. [1 ]
机构
[1] School of Automation and Electrical Engineering, Shenyang Ligong University, Shenyang
来源
Kongzhi yu Juece/Control and Decision | 2022年 / 37卷 / 06期
关键词
Centralized training decentralized execution; Filtering mechanism; Multi-agent system; Reinforcement learning;
D O I
10.13195/j.kzyjc.2020.1139
中图分类号
学科分类号
摘要
When multi-agent systems cooperate or compete, the joint information space will be enlarged and the efficiency of information extraction between agents will be reduced. In this paper, a multi-agent reinforcement learning strategy (FMAC) with filtering mechanism to filter information is adopted to enhance the ability of information communication between agents. By finding the related agents and calculating their information contribution according to the correlation, the method filters out the irrelevant agent information so as to realize the effective communication between agents in cooperative competition or mixed environment. At the same time, the centralized training decentralized execution method is adopted to solve the non-stationarity of environment. In this paper, experiments are carried out by comparing algorithms to verify that the improved algorithm improves the strategy iteration efficiency and generalization ability, and can maintain stable effects when the number of agents increases, which is conducive to the application of multi-agent reinforcement learning to a wider range of fields. Copyright ©2022 Control and Decision.
引用
收藏
页码:1643 / 1648
页数:5
相关论文
共 27 条
[1]  
Gao Y, Chen S F, Lu X., Research on reinforcement learning technology: A review, Acta Automatica Sinica, 30, 1, pp. 86-100, (2004)
[2]  
Arulkumaran K, Deisenroth M P, Brundage M, Et al., Deep reinforcement learning: A brief survey, IEEE Signal Processing Magazine, 34, 6, pp. 26-38, (2017)
[3]  
Zhao Z H, Gao Y, Luo B, Et al., Reinforcement learning technology in multi-agent system, Computer Science, 31, 3, pp. 23-27, (2004)
[4]  
Anderson B D O, Yu C B, Fidan B, Et al., Rigid graph control architectures for autonomous formations, IEEE Control Systems Magazine, 28, 6, pp. 48-63, (2008)
[5]  
Hernandez-Leal P, Kaisers M, Baarslag T, Et al., A survey of learning in multiagent environments: Dealing with non-stationarity, (2017)
[6]  
Matignon L, Laurent G J, Le Fort-Piat N., Independent reinforcement learners in cooperative Markov games: A survey regarding coordination problems, The Knowledge Engineering Review, 27, 1, pp. 1-31, (2012)
[7]  
Zhang J, Pan Y Z, Yang H T, Et al., Multi-agent decision making using Monte Carlo Q-value function, Control and Decision, 35, 3, pp. 637-644, (2020)
[8]  
Littman M L., Markov games as a framework for multi-agent reinforcement learning, Machine Learning Proceedings, pp. 157-163, (1994)
[9]  
Konda V, Tsitsiklis J., Actor-critic algorithms, SIAM Journal on Control and Optimization, 42, 4, pp. 1143-1166, (2003)
[10]  
Kraemer L, Banerjee B., Multi-agent reinforcement learning as a rehearsal for decentralized planning, Neurocomputing, 190, pp. 82-94, (2016)