A state-space representation model and learning algorithm for real-time decision-making under uncertainty

被引:0
|
作者
Malikopoulos, Andreas A. [1 ]
Assanis, Dennis N. [1 ]
Papalambros, Panos Y. [1 ]
机构
[1] Univ Michigan, Dept Mech Engn, Ann Arbor, MI 48109 USA
来源
PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINERING CONGRESS AND EXPOSITION 2007, VOL 9, PTS A-C: MECHANICAL SYSTEMS AND CONTROL | 2008年
关键词
sequential decision-making under uncertainty; Markov Decision Process (MDP); reinforcement learning; learning algorithms; inverted pendulum; vehicle cruise control;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modeling dynamic systems incurring stochastic disturbances for deriving a control policy is a ubiquitous task in engineering. However, in some instances obtaining a model of a system may be impractical or impossible. Alternative approaches have been developed using a simulation-based stochastic framework, in which the system interacts with its environment in real time and obtains information that can be processed to produce an optimal control policy. In this context, the problem of developing a policy for controlling the system's behavior is formulated as a sequential decision-making problem under uncertainty. This paper considers real-time sequential decision-making under uncertainty modeled as a Markov Decision Process (MDP). A state-space representation model is constructed through a learning mechanism and is used to improve system performance over time. The model allows decision making based on gradually enhanced knowledge of system response as it transitions from one state to another, in conjunction with actions taken at each state. A learning algorithm is implemented realizing in real time the optimal control policy associated with the state transitions. The proposed method is demonstrated on the single cart-pole balancing problem and a vehicle cruise control problem.
引用
收藏
页码:575 / 584
页数:10
相关论文
共 50 条
  • [1] A Real-Time Computational Learning Model for Sequential Decision-Making Problems Under Uncertainty
    Malikopoulos, Andreas A.
    Papalambros, Panos Y.
    Assanis, Dennis N.
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2009, 131 (04): : 1 - 8
  • [2] Real-Time Planning as Decision-Making under Uncertainty
    Mitchell, Andrew
    Ruml, Wheeler
    Spaniol, Fabian
    Hoffmann, Joerg
    Petrik, Marek
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2338 - 2345
  • [3] Real-time data stream learning for emergency decision-making under uncertainty
    Wang, Kun
    Xiong, Li
    Xue, Rudan
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2024, 633
  • [4] Constrained optimization under uncertainty for decision-making problems: Application to Real-Time Strategy games
    Antuori, Valentin
    Richoux, Florian
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 458 - 465
  • [5] A Bayesian approach to decision-making under uncertainty: An application to real-time forecasting in the river Rhine
    Reggiani, P.
    Weerts, A. H.
    JOURNAL OF HYDROLOGY, 2008, 356 (1-2) : 56 - 69
  • [6] Representation of real-time decision-making by contextual graphs based simulation
    Brezillon, Patrick
    Aroua, Anissa
    JOURNAL OF DECISION SYSTEMS, 2013, 22 (01) : 28 - 42
  • [7] Real-time Monitoring of Dynamic Traffic States by State-Space Model
    Kawasaki, Yosuke
    Hara, Yusuke
    Kuwahara, Masao
    INTERNATIONAL SYMPOSIA OF TRANSPORT SIMULATION (ISTS) AND THE INTERNATIONAL WORKSHOP ON TRAFFIC DATA COLLECTION AND ITS STANDARDIZATION (IWTDCS): ADVANCED TRANSPORT SIMULATION MODELLING BASED ON BIG DATA, 2017, 21 : 42 - 55
  • [8] REAL-TIME DATA-PROCESSING AND REAL-TIME DECISION-MAKING
    KENNEDY, MH
    HOFFER, JA
    JOURNAL OF SYSTEMS MANAGEMENT, 1978, 29 (10): : 21 - 25
  • [9] SUPPORTING COMPLEX REAL-TIME DECISION-MAKING THROUGH MACHINE LEARNING
    CHATURVEDI, AR
    HUTCHINSON, GK
    NAZARETH, DL
    DECISION SUPPORT SYSTEMS, 1993, 10 (02) : 213 - 233
  • [10] Machine learning process for real-time decision-making in animal husbandry
    de Alencar Naas, I.
    JOURNAL OF ANIMAL SCIENCE, 2018, 96 : 500 - 501