A Survey of Multi-Objective Sequential Decision-Making

被引:357
作者
Roijers, Diederik M. [1 ]
Vamplew, Peter [2 ]
Whiteson, Shimon [1 ]
Dazeley, Richard [2 ]
机构
[1] Univ Amsterdam, Inst Informat, Amsterdam, Netherlands
[2] Univ Ballarat, Sch Sci Informat Technol & Engn, Ballarat, Vic 3353, Australia
关键词
MANY-OBJECTIVE OPTIMIZATION; OBSERVABLE MARKOV-PROCESSES; MULTI-POLICY OPTIMIZATION; INFINITE-HORIZON; REINFORCEMENT; ITERATION; UNCERTAINTY; ALGORITHM; NETWORKS; MODELS;
D O I
10.1613/jair.3987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.
引用
收藏
页码:67 / 113
页数:47
相关论文
共 144 条
[1]  
Aberdeen D., 2004, Proc. ICAPS, V14, P402
[2]   Dynamic scheduling of maintenance tasks in the petroleum industry: A reinforcement approach [J].
Aissani, N. ;
Beldjilali, B. ;
Trentesaux, D. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2009, 22 (07) :1089-1103
[3]  
Aissani N., 2008, Proceedings of the 7th international conference mosim, paris, P698
[4]  
[Anonymous], 2002, Evolutionary algorithms for solving multi-objective problems
[5]  
[Anonymous], 2002, IFAC WORKSH MOD CONT
[6]  
[Anonymous], 1999, STOCH MODEL SER, DOI 10.1201/9781315140223
[7]  
[Anonymous], 2001, NIPS
[8]  
[Anonymous], 1971, Unpublished PhD dissertation
[9]  
[Anonymous], 2001, P ICML
[10]  
[Anonymous], 2012, AAAI