共 120 条
[41]
Zhou H, Lan T, Aggarwal V., PAC: Assisted value factorization with counterfactual predictions in multi-agent reinforcement learning, Proceedings of the 35th Conference on Neural Information Processing Systems, (2022)
[42]
Shen S, Qiu M, Liu J, Et al., ResQ: A residual Q function-based approach for multi-agent reinforcement learning value factorization, Proceedings of the 35th Conference on Neural Information Processing Systems, pp. 5471-5483, (2022)
[43]
Hong Y, Jin Y, Tang Y., Rethinking individual global max in cooperative multi-agent reinforcement learning, Proceedings of the 35th Conference on Neural Information Processing Systems, (2022)
[44]
Yang Y, Luo R, Li M, Et al., Mean field multi-agent reinforcement learning, Proceedings of the 35th International Conference on Machine Learning, pp. 5571-5580, (2018)
[45]
Subramanian S G, Poupart P, Taylor M E, Et al., Multi type mean field reinforcement learning, (2020)
[46]
Zhang T, Ye Q, Bian J, Et al., MFVFD: A multi-agent Q-learning approach to cooperative and non-cooperative tasks, Proceedings of the 30th International Joint Conference on Artificial Intelligence, pp. 500-506, (2021)
[47]
Subramanian SG, Taylor M E, Crowley M, Et al., Decentralized mean field games, Proceedings of the AAAI Conference on Artificial Intelligence, pp. 9439-9447, (2022)
[48]
Ding S, Du W, Ding L, Et al., Multi-agent dueling Q-learning with mean field and value decomposition, Pattern Recognition, 139, (2023)
[49]
Yang M, Liu G, Zhou Z., Partially observable mean field multi-agent reinforcement learning based on graph-attention, (2023)
[50]
Foerster J, Assael I A, De Freitas N, Et al., Learning to communicate with deep multi-agent reinforcement learning, Proceedings of the 29th Conference on Neural Information Processing Systems, pp. 2145-2153, (2016)