Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning

被引：0

作者：

Zimmer, Matthieu ^{[1
]}

Glanois, Claire ^{[1
]}

Siddique, Umer ^{[1
]}

Weng, Paul ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, UM SJTU Joint Inst, Shanghai, Peoples R China

[2] Shanghai Jiao Tong Univ, Dept Automat, Shanghai, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139 | 2021年 / 139卷

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the problem of learning fair policies in (deep) cooperative multi-agent reinforcement learning (MARL). We formalize it in a principled way as the problem of optimizing a welfare function that explicitly encodes two important aspects of fairness: efficiency and equity. We provide a theoretical analysis of the convergence of policy gradient for this problem. As a solution method, we propose a novel neural network architecture, which is composed of two sub-networks specifically designed for taking into account these two aspects of fairness. In experiments, we demonstrate the importance of the two sub-networks for fair optimization. Our overall approach is general as it can accommodate any (sub)differentiable welfare function. Therefore, it is compatible with various notions of fairness that have been proposed in the literature (e.g., lexicographic max-imin, generalized Gini social welfare function, proportional fairness). Our method is generic and can be implemented in various MARL settings: centralized training and decentralized execution, or fully decentralized. Finally, we experimentally validate our approach in various domains and show that it can perform much better than previous methods, both in terms of efficiency and equity.

引用

页数：12

共 50 条

[31] Scalable Reinforcement Learning Policies for Multi-Agent Control
Hsu, Christopher D.
Jeong, Heejin
Pappas, George J.
Chaudhari, Pratik
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4785 - 4791
[32] Training Cooperative Agents for Multi-Agent Reinforcement Learning
Bhalla, Sushrut
Subramanian, Sriram G.
Crowley, Mark
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1826 - 1828
[33] Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Liu, Iou-Jen
Jain, Unnat
Yeh, Raymond A.
Schwing, Alexander G.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[34] Reinforcement learning of coordination in cooperative multi-agent systems
Kapetanakis, S
Kudenko, D
EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 326 - 331
[35] Centralized reinforcement learning for multi-agent cooperative environments
Lu, Chengxuan
Bao, Qihao
Xia, Shaojie
Qu, Chongxiao
EVOLUTIONARY INTELLIGENCE, 2024, 17 (01) : 267 - 273
[36] Cooperative multi-agent game based on reinforcement learning
Liu, Hongbo
HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
[37] Pacesetter Learning for Large Scale Cooperative Multi-Agent Reinforcement Learning
Zhou, Pingqi
Li, Chao
Qiu, Mengwei
Liu, Jun
Ma, Chennan
Yan, Ming
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 115 - 126
[38] Learning Distinct Strategies for Heterogeneous Cooperative Multi-agent Reinforcement Learning
Wan, Kejia
Xu, Xinhai
Li, Yuan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 544 - 555
[39] QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement learning
Son, Kyunghwan
Kim, Daewoo
Kang, Wan Ju
Hostallero, David
Yi, Yung
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[40] Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning
Zhou, Meng
Liu, Ziyu
Sui, Pengwei
Li, Yixuan
Chung, Yuk Ying
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33

← 1 2 3 4 5 →