共 20 条
- [1] Liang X X, Feng Y H, Ma Y, Et al., Deep multi-agent reinforcement learning: A survey, Acta Automatica Sinica, 46, 12, pp. 2537-2557, (2020)
- [2] Sun C Y, Mu C X., Important scientific problems of multi-agent deep reinforcement learning, Acta Automatica Sinica, 46, 7, pp. 1301-1312, (2020)
- [3] Cao Y C, Yu W W, Ren W, Et al., An overview of recent progress in the study of distributed multi-agent coordination, IEEE Transactions on Industrial Informatics, 9, 1, pp. 427-438, (2013)
- [4] Ye D, Zhang M, Yang Y., A multi-agent frame work for packet routing in wireless sensor networks, Sensors, 15, 5, pp. 10026-10047, (2015)
- [5] Huttenrauch M, Sosic A, Neumann G., Guided deep reinforcement learning for swarm systems, (2017)
- [6] Oliehoek F A, Amato C., Infinite-horizon decPOMDPs, A Concise Introduction to Decentralized POMDPs, pp. 69-77, (2016)
- [7] Lowe R, Wu Y, Tamar A, Et al., Multi-agent actor-critic for mixed cooperative-competitive environments, Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 6382-6393, (2017)
- [8] Wang Y H, Han B N, Wang T H, Et al., DOP: Off-policy multi-agent decomposed policy gradients, (2021)
- [9] Chen L, Liang C, Zhang J Y, Et al., A multi-agent reinforcement learning algorithm based on improved DDPG in actor-critic framework, Control and Decision, 36, 1, pp. 75-82, (2021)
- [10] Wang J H, Ren Z Z, Liu T, Et al., QPLEX: Duplex dueling multi-agent Q-learning[J/OL], (2021)