Efficient Policies for Stationary Possibilistic Markov Decision Processes

被引：1

作者：

Ben Amor, Nahla ^{[1
]}

El Khalfi, Zeineb ^{[1
,2
]}

Fargier, Helene ^{[2
]}

Sabaddin, Regis ^{[3
]}

机构：

[1] LARODEC, Le Bardo, Tunisia

[2] IRIT, Toulouse, France

[3] INRA, MIAT, Toulouse, France

来源：

SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2017 | 2017年 / 10369卷

关键词：

Markov Decision Process; Possibility theory; Lexicographic comparisons; Possibilistic qualitative utilities;

D O I：

10.1007/978-3-319-61581-3_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Possibilistic Markov Decision Processes offer a compact and tractable way to represent and solve problems of sequential decision under qualitative uncertainty. Even though appealing for its ability to handle qualitative problems, this model suffers from the drowning effect that is inherent to possibilistic decision theory. The present paper proposes to escape the drowning effect by extending to stationary possibilistic MDPs the lexicographic preference relations defined in [6] for nonsequential decision problems and provides a value iteration algorithm to compute policies that are optimal for these new criteria.

引用

页码：306 / 317

页数：12

共 18 条

[1] [Anonymous], P UAI 13
[2] [Anonymous], 2011, P INT C AUTOMATED PL
[3] [Anonymous], P UAI 2005
[4] [Anonymous], 2020, Reinforcement Learning, An Introduction
[5] Anytime Algorithms for Solving Possibilistic MDPs and Hybrid MDPs
Bauters, Kim
Liu, Weiru
Godo, Lluis
[J]. FOUNDATIONS OF INFORMATION AND KNOWLEDGE SYSTEMS (FOIKS 2016), 2016, 9616 : 24 - 41
[6] A MARKOVIAN DECISION PROCESS
BELLMAN, R
[J]. JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05): : 679 - 684
[7] Lexicographic Refinements in Possibilistic Decision Trees
Ben Amor, Nahla
El Khalfi, Zeineb
Fargier, Helene
Sabbadin, Regis
[J]. ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 202 - 208
[8] Dubois D., 1995, Proceedings of the 14th International Joint Conference on Artifcial Intelligence IJCAI'95, P1925
[9] Qualitative decision under uncertainty: back to expected utility
Fargier, H
Sabbadin, W
[J]. ARTIFICIAL INTELLIGENCE, 2005, 164 (1-2) : 245 - 280
[10] Gilbert H., 2016, P JMLR 2016, V60, P1

← 1 2 →