共 27 条
[1]
Gao Y, Chen S F, Lu X., Research on reinforcement learning technology: A review, Acta Automatica Sinica, 30, 1, pp. 86-100, (2004)
[2]
Arulkumaran K, Deisenroth M P, Brundage M, Et al., Deep reinforcement learning: A brief survey, IEEE Signal Processing Magazine, 34, 6, pp. 26-38, (2017)
[3]
Zhao Z H, Gao Y, Luo B, Et al., Reinforcement learning technology in multi-agent system, Computer Science, 31, 3, pp. 23-27, (2004)
[4]
Anderson B D O, Yu C B, Fidan B, Et al., Rigid graph control architectures for autonomous formations, IEEE Control Systems Magazine, 28, 6, pp. 48-63, (2008)
[5]
Hernandez-Leal P, Kaisers M, Baarslag T, Et al., A survey of learning in multiagent environments: Dealing with non-stationarity, (2017)
[6]
Matignon L, Laurent G J, Le Fort-Piat N., Independent reinforcement learners in cooperative Markov games: A survey regarding coordination problems, The Knowledge Engineering Review, 27, 1, pp. 1-31, (2012)
[7]
Zhang J, Pan Y Z, Yang H T, Et al., Multi-agent decision making using Monte Carlo Q-value function, Control and Decision, 35, 3, pp. 637-644, (2020)
[8]
Littman M L., Markov games as a framework for multi-agent reinforcement learning, Machine Learning Proceedings, pp. 157-163, (1994)
[9]
Konda V, Tsitsiklis J., Actor-critic algorithms, SIAM Journal on Control and Optimization, 42, 4, pp. 1143-1166, (2003)
[10]
Kraemer L, Banerjee B., Multi-agent reinforcement learning as a rehearsal for decentralized planning, Neurocomputing, 190, pp. 82-94, (2016)