Multi-Agent Adversarial Inverse Reinforcement Learning

被引：0

作者：

Yu, Lantao ^{[1
]}

Song, Jiaming ^{[1
]}

Ermon, Stefano ^{[1
]}

机构：

[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97 | 2019年 / 97卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning agents are prone to undesired behaviors due to reward mis-specification. Finding a set of reward functions to properly guide agent behaviors is particularly challenging in multi-agent scenarios. Inverse reinforcement learning provides a framework to automatically acquire suitable reward functions from expert demonstrations. Its extension to multi-agent settings, however, is difficult due to the more complex notions of rational behaviors. In this paper, we propose MA-AIRL, a new framework for multi-agent inverse reinforcement learning, which is effective and scalable for Markov games with high-dimensional state-action space and unknown dynamics We derive our algorithm based on a new solution concept and maximum pseudolikelihood estimation within an adversarial reward learning framework. In the experiments, we demonstrate that MA-AIRL can recover reward functions that are highly correlated with ground truth ones, and significantly outperforms prior methods in terms of policy imitation.

引用

页数：8

共 50 条

[41] Learning to Communicate with Deep Multi-Agent Reinforcement Learning
Foerster, Jakob N.
Assael, Yannis M.
de Freitas, Nando
Whiteson, Shimon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[42] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Xu, Zhiwei
Zhang, Bin
Li, Dapeng
Zhang, Zeren
Zhou, Guangchong
Chen, Hao
Fan, Guoliang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
[43] Concept Learning for Interpretable Multi-Agent Reinforcement Learning
Zabounidis, Renos
Campbell, Joseph
Stepputtis, Simon
Hughes, Dana
Sycara, Katia
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1828 - 1837
[44] Learning structured communication for multi-agent reinforcement learning
Sheng, Junjie
Wang, Xiangfeng
Jin, Bo
Yan, Junchi
Li, Wenhao
Chang, Tsung-Hui
Wang, Jun
Zha, Hongyuan
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
[45] Learning structured communication for multi-agent reinforcement learning
Junjie Sheng
Xiangfeng Wang
Bo Jin
Junchi Yan
Wenhao Li
Tsung-Hui Chang
Jun Wang
Hongyuan Zha
Autonomous Agents and Multi-Agent Systems, 2022, 36
[46] Generalized learning automata for multi-agent reinforcement learning
De Hauwere, Yann-Michael
Vrancx, Peter
Nowe, Ann
AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324
[47] Multi-agent Reinforcement Learning Aided Sampling Algorithms for a Class of Multiscale Inverse Problems
Chung, Eric
Leung, Wing Tat
Pun, Sai-Mang
Zhang, Zecheng
JOURNAL OF SCIENTIFIC COMPUTING, 2023, 96 (02)
[48] Multi-agent Inverse Reinforcement Learning for Certain General-Sum Stochastic Games
Lin, Xiaomin
Adams, Stephen C.
Beling, Peter A.
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2019, 66 : 473 - 502
[49] Multi-agent Reinforcement Learning Aided Sampling Algorithms for a Class of Multiscale Inverse Problems
Eric Chung
Wing Tat Leung
Sai-Mang Pun
Zecheng Zhang
Journal of Scientific Computing, 2023, 96
[50] Multi-agent reinforcement learning for character control
Li, Cheng
Fussell, Levi
Komura, Taku
VISUAL COMPUTER, 2021, 37 (12): : 3115 - 3123

← 1 2 3 4 5 →