Reinforcement Learning for Stochastic Max-Plus Linear Systems

被引:0
作者
Subramanian, Vignesh [1 ]
Farhadi, Farzaneh [2 ]
Soudjani, Sadegh [3 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Newcastle Univ, Sch Engn, Newcastle Upon Tyne, Tyne & Wear, England
[3] Newcastle Univ, Sch Comp, Newcastle Upon Tyne, Tyne & Wear, England
来源
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年
基金
英国工程与自然科学研究理事会;
关键词
DISCRETE-EVENT SYSTEMS; REACHABILITY ANALYSIS;
D O I
10.1109/CDC49753.2023.10384207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the design of control policies for Discrete Event Systems under uncertainties. We capture the timing of the events using the framework of max-plus-linear systems in which the time between consecutive events depends on random delays with unknown distributions. Our policy synthesis approach is with respect to a cost function, and it can be extended directly to satisfy safety specifications on the timing of events. The main novelty of our approach is to translate the system evolution to a Markov decision process (MDP) that has an uncountable state space and develop a stochastic optimisation problem under the evolution of the MDP. To tackle the unknown distribution of uncertainties (thus unknown transition probabilities in the MDP), we employ model-free reinforcement learning to perform optimisations and find control policies for the system. Our implementation results on the 9-dimensional model of a railway network show superiority of our learning approach in comparison with the stochastic model predictive control approach.
引用
收藏
页码:5631 / 5638
页数:8
相关论文
共 50 条
  • [21] On the control of max-plus linear system subject to state restriction
    Maia, C. A.
    Andrade, C. R.
    Hardouin, L.
    AUTOMATICA, 2011, 47 (05) : 988 - 992
  • [22] SMT-Based Reachability Analysis of High Dimensional Interval Max-Plus Linear Systems
    Mufid, Muhammad Syifa'ul
    Adzkiya, Dieky
    Abate, Alessandro
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 67 (06) : 2700 - 2714
  • [23] Comparing Disjunctive and Concise Approaches for Set-Guaranteed Estimation in Max-Plus Linear Systems
    Espindola-Winck, Guilherme
    Hardouin, Laurent
    Lhommeau, Mehdi
    IFAC PAPERSONLINE, 2024, 58 (01): : 36 - 41
  • [24] Efficient State-Estimation of Uncertain Max-Plus Linear Systems with High Observation Noise
    Espindola-Winck, Guilherme
    Candido, Renato Markele Ferreira
    Hardouin, Laurent
    Lhommeau, Mehdi
    IFAC PAPERSONLINE, 2022, 55 (28): : 228 - 235
  • [25] Analysis of Decision Stochastic Discrete-Event Systems Aggregating Max-Plus Algebra and Markov Chain
    Ribeiro, G. R.
    Saldanha, R. R.
    Maia, C. A.
    JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2018, 29 (05) : 576 - 585
  • [26] Modeling and scheduling of production systems by using max-plus algebra
    Al Bermanei, Hazem
    Boling, Jari M.
    Hognas, Goeran
    FLEXIBLE SERVICES AND MANUFACTURING JOURNAL, 2024, 36 (01) : 129 - 150
  • [27] Modeling and scheduling of production systems by using max-plus algebra
    Hazem Al Bermanei
    Jari M. Böling
    Göran Högnäs
    Flexible Services and Manufacturing Journal, 2024, 36 : 129 - 150
  • [28] Toward the Application of a Critical-Chain-Project-Management-based Framework on Max-plus Linear Systems
    Takahashi, Hirotaka
    Goto, Hiroyuki
    Kasahara, Munenori
    INDUSTRIAL ENGINEERING AND MANAGEMENT SYSTEMS, 2009, 8 (03): : 155 - 161
  • [29] The set of realizations of a max-plus linear sequence is semi-polyhedral
    Blondel, Vincent
    Gaubert, Stephane
    Portier, Natacha
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2011, 77 (04) : 820 - 833
  • [30] On the Existence of Simulations for Max-Plus Automata
    Daviaud, Berangere
    Lahaye, Sebastien
    Lhommeau, Mehdi
    Komenda, Jan
    IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 694 - 699