Multi-objective multi-agent decision making: a utility-based analysis and survey

被引:56
作者
Radulescu, Roxana [1 ]
Mannion, Patrick [2 ]
Roijers, Diederik M. [1 ,3 ,4 ]
Nowe, Ann [1 ]
机构
[1] Vrije Univ Brussel, Artificial Intelligence Lab, Brussels, Belgium
[2] Natl Univ Ireland Galway, Sch Comp Sci, Galway, Ireland
[3] HU Univ Appl Sci Utrecht, Inst ICT, Utrecht, Netherlands
[4] Vrije Univ Amsterdam, Computat Intelligence, Amsterdam, Netherlands
关键词
Multi-agent systems; Multi-objective decision making; Multi-objective optimisation criteria; Solution concepts; Reinforcement learning; PARETO EQUILIBRIA; COMPREHENSIVE SURVEY; MULTICRITERIA; OPTIMIZATION; MANAGEMENT; NETWORKS; SYSTEMS; GAMES; NEGOTIATION; PERFORMANCE;
D O I
10.1007/s10458-019-09433-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The majority of multi-agent system implementations aim to optimise agents' policies with respect to a single objective, despite the fact that many real-world problem domains are inherently multi-objective in nature. Multi-objective multi-agent systems (MOMAS) explicitly consider the possible trade-offs between conflicting objective functions. We argue that, in MOMAS, such compromises should be analysed on the basis of the utility that these compromises have for the users of a system. As is standard in multi-objective optimisation, we model the user utility using utility functions that map value or return vectors to scalar values. This approach naturally leads to two different optimisation criteria: expected scalarised returns (ESR) and scalarised expected returns (SER). We develop a new taxonomy which classifies multi-objective multi-agent decision making settings, on the basis of the reward structures, and which and how utility functions are applied. This allows us to offer a structured view of the field, to clearly delineate the current state-of-the-art in multi-objective multi-agent decision making approaches and to identify promising directions for future research. Starting from the execution phase, in which the selected policies are applied and the utility for the users is attained, we analyse which solution concepts apply to the different settings in our taxonomy. Furthermore, we define and discuss these solution concepts under both ESR and SER optimisation criteria. We conclude with a summary of our main findings and a discussion of many promising future research directions in multi-objective multi-agent systems.
引用
收藏
页数:52
相关论文
共 194 条
  • [1] Abels A, 2019, PR MACH LEARN RES, V97
  • [2] Ahmad I, 2008, 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, P2645
  • [3] Autonomous agents modelling other agents: A comprehensive survey and open problems
    Albrecht, Stefano V.
    Stone, Peter
    [J]. ARTIFICIAL INTELLIGENCE, 2018, 258 : 66 - 95
  • [4] Aleksandrov M, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P42
  • [5] Learning in multi-agent systems
    Alonso, E
    D'Inverno, M
    Kudenko, D
    Luck, M
    Noble, J
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2001, 16 (03) : 277 - 284
  • [6] Altman E., 1999, Constrained Markov decision processes, V7
  • [7] [Anonymous], 2007, P 6 INT JOINT C AUT
  • [8] [Anonymous], MULTICRITERIA NEGOTI
  • [9] [Anonymous], 2015, THESIS
  • [10] [Anonymous], 2018, ADV NEURAL INFORM PR