Towards possibilistic reinforcement learning algorithms

被引:0
|
作者
Sabbadin, R [1 ]
机构
[1] INRA, Unite Biometrie & Intelligence Artificielle, F-31329 Castanet Tolosan, France
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a framework and algorithms for reinforcement learning in sequential decision problems under uncertainty in which the rewards are qualitative, and/or am temporarily aggregated by a "minimum" instead of a sum as in the classical Markov Decision Processes (MDP) framework. The framework is based on a "possibilistic" version of Markov Decision Processes and the learning algorithms are based on indirect methods in which the possibilistic model of the problem is learned while the problem itself is solved, using Dynamic Prong.
引用
收藏
页码:404 / 407
页数:4
相关论文
共 50 条
  • [1] Improving reinforcement learning algorithms: Towards optimal learning rate policies
    Mounjid, Othmane
    Lehalle, Charles-Albert
    MATHEMATICAL FINANCE, 2024, 34 (02) : 588 - 621
  • [2] A study of natural robustness of deep reinforcement learning algorithms towards adversarial perturbations
    Liu, Qisai
    Lee, Xian Yeow
    Sarkar, Soumik
    AI OPEN, 2024, 5 : 126 - 141
  • [3] Evolutionary algorithms for reinforcement learning
    Moriarty, DE
    Schultz, AC
    Grefenstette, JJ
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 241 - 276
  • [4] Evolutionary Algorithms for Reinforcement Learning
    Moriarty, David E.
    Schultz, Alan C.
    Grefenstette, John J.
    Journal of Artificial Intelligence Research, 1999, 11 (00): : 241 - 276
  • [5] Ensemble algorithms in reinforcement learning
    Wiering, Marco A.
    van Hasselt, Hado
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 930 - 936
  • [6] REINFORCEMENT LEARNING ALGORITHMS IN ROBOTICS
    Bocsi, Botond
    Csato, Lehel
    KEPT 2011: KNOWLEDGE ENGINEERING PRINCIPLES AND TECHNIQUES, 2011, : 131 - 142
  • [7] REINFORCEMENT LEARNING - ARCHITECTURES AND ALGORITHMS
    KOKAR, MM
    REVELIOTIS, SA
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1993, 8 (08) : 875 - 894
  • [8] Aggregation of reinforcement learning algorithms
    Jiang, Ju
    Kamel, Mohamed S.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 68 - +
  • [9] Convergence Theorems of Possibilistic Clustering Algorithms and Generalized Possibilistic Clustering Algorithms
    Lin, Qihang
    Zhou, Jian
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION AND MANAGEMENT SCIENCES, 2009, 8 : 950 - 957
  • [10] Evaluating Product-Based Possibilistic Networks Learning Algorithms
    Haddad, Maroua
    Leray, Philippe
    Ben Amor, Nahla
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2015, 2015, 9161 : 312 - 321