An Aggregation Procedure for Large-Scale Markov Decision Processes

被引:0
|
作者
Bartl, Ondrej [1 ]
机构
[1] Univ Zilina, Fac Management Sci & Informat, Dept Software Technol, Zilina 01026, Slovakia
关键词
Markov/semi-Markov decision processes; state and action aggregation;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
Markov decision models with a high state space cardinality may resist being computationally tractable. Then reduction in the number of possible state variable values can help. The fixed-weight aggregation procedure for large-scale Markov/semi-Markov decision processes is described in the paper. A possibility to approximate an original decision model by action aggregation accompanying state aggregation is mentioned as well.
引用
收藏
页码:9 / 15
页数:7
相关论文
共 50 条
  • [1] On State Aggregation to Approximate Complex Value Functions in Large-Scale Markov Decision Processes
    Jia, Qing-Shan
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2011, 56 (02) : 333 - 344
  • [2] AN ITERATIVE AGGREGATION PROCEDURE FOR MARKOV DECISION-PROCESSES
    MENDELSSOHN, R
    OPERATIONS RESEARCH, 1982, 30 (01) : 62 - 73
  • [3] COMPUTATION TECHNIQUES FOR LARGE-SCALE UNDISCOUNTED MARKOV DECISION-PROCESSES
    HODGSON, TJ
    KOEHLER, GJ
    NAVAL RESEARCH LOGISTICS, 1979, 26 (04) : 587 - 594
  • [4] Sketched Newton Value Iteration for Large-Scale Markov Decision Processes
    Liu, Jinsong
    Xie, Chenghan
    Deng, Qi
    Ge, Dongdong
    Ye, Yinyu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13936 - 13944
  • [5] Inexact GMRES Policy Iteration for Large-Scale Markov Decision Processes
    Gargiani, Matilde
    Liao-McPherson, Dominic
    Zanelli, Andrea
    Lygeros, John
    IFAC PAPERSONLINE, 2023, 56 (02): : 11249 - 11254
  • [6] A hierarchical decision procedure for productivity innovation in large-scale petrochemical processes
    Han, Chonghun
    Kim, Minjin
    Yoon, En Sup
    COMPUTERS & CHEMICAL ENGINEERING, 2008, 32 (4-5) : 1029 - 1041
  • [7] An application of simulation for large-scale Markov decision processes to a problem in telephone network routing
    Zobel, C
    Scherer, WT
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2944 - 2955
  • [8] Simulation-based policy generation using large-scale Markov decision processes
    Zobel, CW
    Scherer, WT
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (06): : 609 - 622
  • [9] Faster saddle-point optimization for solving large-scale Markov decision processes
    Bas-Serrano, Joan
    Neu, Gergely
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 413 - 423
  • [10] Kernelized Q-Learning for Large-Scale, Potentially Continuous, Markov Decision Processes
    Sledge, Isaac J.
    Principe, Jose C.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 153 - 162