共 28 条
[1]
Agogino A. K., Tumer K., Unifying temporal and structural credit assignment problems, Proceedings of the 3rd International Joint Conference on Autonomous Agents and Multiagent Systems, 2, pp. 980-987, (2004)
[2]
Agogino A., Tumer K., Multi-agent reward analysis for learning in noisy domains, Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, (AAMAS ’05), pp. 81-88, (2005)
[3]
Agogino A. K., Tumer K., Analyzing and visualizing multiagent rewards in dynamic and stochastic environments, Journal of Autonomous Agents and Multiagent Systems, pp. 320-338, (2008)
[4]
Benda M., On optimal cooperation of knowledge source, (1985)
[5]
Berner C., Brockman G., Chan B., Cheung V., Debiak P., Dennison C., Farhi D., Fischer Q., Hashme S., Hesse C., Jozefowicz R., Gray S., Olsson C., Pachocki J., Petrov M., Pinto Oliveira, de H. P., Raiman J., Salimans T., Schlatter J., Schneider J., Sidor S., Sutskever I., Tang J., Wolski F., Zhang S., Dota 2 with large scale deep reinforcement learning, CoRR, (2019)
[6]
Bhalla S., Ganapathi Subramanian S., Crowley M., Deep multi agent reinforcement learning for autonomous driving, Advances in Artificial Intelligence, pp. 67-78, (2020)
[7]
Bu L., Babu R., De Schutter B., Et al., A comprehensive survey of multiagent reinforcement learning, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 38, 2, pp. 156-172, (2008)
[8]
Desjardins C., Chaib-draa B., Cooperative adaptive cruise control: A reinforcement learning approach, Intelligent Transportation Systems, IEEE Transactions on, 12, pp. 1248-1260, (2012)
[9]
Devlin S., Yliniemi L., Kudenko D., Tumer K., Potential-based difference rewards for multiagent reinforcement learning, Proceedings of the 2014 International Conference on Autonomous Agents and Multi-agent Systems, (AAMAS ’14), pp. 165-172, (2014)
[10]
Foerster J. N., Farquhar G., Afouras T., Nardelli N., Whiteson S., Counterfactual multi-agent policy gradients, CoRR, (2017)