Efficient Policies for Stationary Possibilistic Markov Decision Processes

被引:1
作者
Ben Amor, Nahla [1 ]
El Khalfi, Zeineb [1 ,2 ]
Fargier, Helene [2 ]
Sabaddin, Regis [3 ]
机构
[1] LARODEC, Le Bardo, Tunisia
[2] IRIT, Toulouse, France
[3] INRA, MIAT, Toulouse, France
来源
SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2017 | 2017年 / 10369卷
关键词
Markov Decision Process; Possibility theory; Lexicographic comparisons; Possibilistic qualitative utilities;
D O I
10.1007/978-3-319-61581-3_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Possibilistic Markov Decision Processes offer a compact and tractable way to represent and solve problems of sequential decision under qualitative uncertainty. Even though appealing for its ability to handle qualitative problems, this model suffers from the drowning effect that is inherent to possibilistic decision theory. The present paper proposes to escape the drowning effect by extending to stationary possibilistic MDPs the lexicographic preference relations defined in [6] for nonsequential decision problems and provides a value iteration algorithm to compute policies that are optimal for these new criteria.
引用
收藏
页码:306 / 317
页数:12
相关论文
共 18 条
  • [1] [Anonymous], P UAI 13
  • [2] [Anonymous], 2011, P INT C AUTOMATED PL
  • [3] [Anonymous], P UAI 2005
  • [4] [Anonymous], 2020, Reinforcement Learning, An Introduction
  • [5] Anytime Algorithms for Solving Possibilistic MDPs and Hybrid MDPs
    Bauters, Kim
    Liu, Weiru
    Godo, Lluis
    [J]. FOUNDATIONS OF INFORMATION AND KNOWLEDGE SYSTEMS (FOIKS 2016), 2016, 9616 : 24 - 41
  • [6] A MARKOVIAN DECISION PROCESS
    BELLMAN, R
    [J]. JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05): : 679 - 684
  • [7] Lexicographic Refinements in Possibilistic Decision Trees
    Ben Amor, Nahla
    El Khalfi, Zeineb
    Fargier, Helene
    Sabbadin, Regis
    [J]. ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 202 - 208
  • [8] Dubois D., 1995, Proceedings of the 14th International Joint Conference on Artifcial Intelligence IJCAI'95, P1925
  • [9] Qualitative decision under uncertainty: back to expected utility
    Fargier, H
    Sabbadin, W
    [J]. ARTIFICIAL INTELLIGENCE, 2005, 164 (1-2) : 245 - 280
  • [10] Gilbert H., 2016, P JMLR 2016, V60, P1