An Aggregation Procedure for Large-Scale Markov Decision Processes

被引:0
|
作者
Bartl, Ondrej [1 ]
机构
[1] Univ Zilina, Fac Management Sci & Informat, Dept Software Technol, Zilina 01026, Slovakia
关键词
Markov/semi-Markov decision processes; state and action aggregation;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
Markov decision models with a high state space cardinality may resist being computationally tractable. Then reduction in the number of possible state variable values can help. The fixed-weight aggregation procedure for large-scale Markov/semi-Markov decision processes is described in the paper. A possibility to approximate an original decision model by action aggregation accompanying state aggregation is mentioned as well.
引用
收藏
页码:9 / 15
页数:7
相关论文
共 50 条
  • [21] LARGE-SCALE PROCESSES ON THE SUN
    DOLGINOV, AZ
    IZVESTIYA AKADEMII NAUK SSSR SERIYA FIZICHESKAYA, 1983, 47 (09): : 1693 - 1694
  • [22] LARGE-SCALE WEATHER PROCESSES
    不详
    NATURE, 1956, 177 (4499) : 113 - 115
  • [23] New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system
    Ohno, Katsuhisa
    Boh, Toshitaka
    Nakade, Koichi
    Tamura, Takayoshi
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 249 (01) : 22 - 31
  • [24] LARGE-SCALE PROCESSES ON MOON
    FIELDER, G
    GEOPHYSICAL JOURNAL OF THE ROYAL ASTRONOMICAL SOCIETY, 1977, 49 (01): : 302 - 302
  • [25] Large-Scale Markov Decision Problems with KL Control Cost and its Application to Crowdsourcing
    Abbasi-Yadkori, Yasin
    Bartlett, Peter L.
    Chen, Xi
    Malek, Alan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1053 - 1062
  • [26] A METHODOLOGY FOR COMPUTATION REDUCTION FOR SPECIALLY STRUCTURED LARGE-SCALE MARKOV DECISION-PROBLEMS
    DING, FY
    HODGSON, TJ
    KING, RE
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1988, 34 (01) : 105 - 112
  • [27] A procedure for large-scale DEA computations
    Chen, Wen-Chih
    Cho, Wei-Jen
    COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (06) : 1813 - 1824
  • [28] A PROCEDURE FOR A LARGE-SCALE PREPARATION OF THIOPHOSGENE
    HORAK, J
    ZBIROVSK.M
    INDUSTRIE CHIMIQUE BELGE-BELGISCHE CHEMISCHE INDUSTRIE, 1966, 31 (SEP): : P141 - &
  • [29] A learning algorithm for Markov decision processes with adaptive state aggregation
    Baras, JS
    Borkar, VS
    PROCEEDINGS OF THE 39TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 2000, : 3351 - 3356
  • [30] A parallel solver for large-scale Markov chains
    Benzi, M
    Tuma, M
    APPLIED NUMERICAL MATHEMATICS, 2002, 41 (01) : 135 - 153