Research on multi-agent strategy based on filtering mechanism to filter information

被引：0

作者：

Chen L. ^{[1
]}

Guo T. ^{[1
]}

Liu Y.-T. ^{[1
]}

Yang J.-M. ^{[1
]}

机构：

[1] School of Automation and Electrical Engineering, Shenyang Ligong University, Shenyang

来源：

Kongzhi yu Juece/Control and Decision | 2022年 / 37卷 / 06期

关键词：

Centralized training decentralized execution; Filtering mechanism; Multi-agent system; Reinforcement learning;

D O I：

10.13195/j.kzyjc.2020.1139

中图分类号：

学科分类号：

摘要：

When multi-agent systems cooperate or compete, the joint information space will be enlarged and the efficiency of information extraction between agents will be reduced. In this paper, a multi-agent reinforcement learning strategy (FMAC) with filtering mechanism to filter information is adopted to enhance the ability of information communication between agents. By finding the related agents and calculating their information contribution according to the correlation, the method filters out the irrelevant agent information so as to realize the effective communication between agents in cooperative competition or mixed environment. At the same time, the centralized training decentralized execution method is adopted to solve the non-stationarity of environment. In this paper, experiments are carried out by comparing algorithms to verify that the improved algorithm improves the strategy iteration efficiency and generalization ability, and can maintain stable effects when the number of agents increases, which is conducive to the application of multi-agent reinforcement learning to a wider range of fields. Copyright ©2022 Control and Decision.

引用

页码：1643 / 1648

页数：5

共 27 条

[1]

Gao Y, Chen S F, Lu X., Research on reinforcement learning technology: A review, Acta Automatica Sinica, 30, 1, pp. 86-100, (2004)

[2]

Arulkumaran K, Deisenroth M P, Brundage M, Et al., Deep reinforcement learning: A brief survey, IEEE Signal Processing Magazine, 34, 6, pp. 26-38, (2017)

[3]

Zhao Z H, Gao Y, Luo B, Et al., Reinforcement learning technology in multi-agent system, Computer Science, 31, 3, pp. 23-27, (2004)

[4]

Anderson B D O, Yu C B, Fidan B, Et al., Rigid graph control architectures for autonomous formations, IEEE Control Systems Magazine, 28, 6, pp. 48-63, (2008)

[5]

Hernandez-Leal P, Kaisers M, Baarslag T, Et al., A survey of learning in multiagent environments: Dealing with non-stationarity, (2017)

[6]

Matignon L, Laurent G J, Le Fort-Piat N., Independent reinforcement learners in cooperative Markov games: A survey regarding coordination problems, The Knowledge Engineering Review, 27, 1, pp. 1-31, (2012)

[7]

Zhang J, Pan Y Z, Yang H T, Et al., Multi-agent decision making using Monte Carlo Q-value function, Control and Decision, 35, 3, pp. 637-644, (2020)

[8]

Littman M L., Markov games as a framework for multi-agent reinforcement learning, Machine Learning Proceedings, pp. 157-163, (1994)

[9]

Konda V, Tsitsiklis J., Actor-critic algorithms, SIAM Journal on Control and Optimization, 42, 4, pp. 1143-1166, (2003)

[10]

Kraemer L, Banerjee B., Multi-agent reinforcement learning as a rehearsal for decentralized planning, Neurocomputing, 190, pp. 82-94, (2016)

← 1 2 3 →