Control Theory Meets POMDPs: A Hybrid Systems Approach

被引:2
|
作者
Ahmadi, Mohamadreza [1 ]
Jansen, Nils [2 ]
Wu, Bo [3 ]
Topcu, Ufuk [3 ]
机构
[1] CALTECH, Ctr Autonomous Syst & Technol, Pasadena, CA 91125 USA
[2] Radboud Univ Nijmegen, Inst Comp & Informat Sci, Dept Software Sci, NL-6500 GL Nijmegen, Netherlands
[3] Univ Texas Austin, Oden Inst Computat Engn & Sci, Austin, TX 78712 USA
关键词
Safety; Decision making; Markov processes; Kalman filters; Control theory; Bayes methods; Switched systems; Artificial intelligence; autonomous systems; control theory; Lyapunov methods; MARKOV-PROCESSES; OPTIMIZATION; APPROXIMATIONS; STABILITY;
D O I
10.1109/TAC.2020.3035755
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partially observable Markov decision processes (POMDPs) provide a modeling framework for a variety of sequential decision making under uncertainty scenarios in artificial intelligence (AI). Since the states are not directly observable in a POMDP, decision making has to be performed based on the output of a Bayesian filter (continuous beliefs); hence, making POMDPs intractable to solve and analyze. To overcome the complexity challenge of POMDPs, we apply techniques from the control theory. Our contributions are fourfold. 1) We begin by casting the problem of analyzing a POMDP into analyzing the behavior of a discrete-time switched system. 2) Then, in order to estimate the reachable belief space of a POMDP, i.e., the set of all possible evolutions given an initial belief distribution over the states and a set of actions and observations, we find overapproximations in terms of sublevel sets of Lyapunov-like functions. 3) Furthermore, in order to verify safety and performance requirements of a given POMDP, we formulate a barrier certificate theorem, wherein we show that if there exists a barrier certificate satisfying a set of inequalities along the solutions to the belief update equation of the POMDP, the safety and performance properties are guaranteed to hold. In both cases 2) and 3), the calculations can be decomposed and solved in parallel. 4) Finally, we show that the conditions we formulate can be computationally implemented as a set of sum-of-squares programs. We illustrate the applicability of our method by addressing two problems in active ad scheduling and machine teaching.
引用
收藏
页码:5191 / 5204
页数:14
相关论文
共 50 条
  • [41] Fejer Polynomials and Control of Nonlinear Discrete Systems
    Dmitrishin, D.
    Hagelstein, P.
    Khamitova, A.
    Korenovskyi, A.
    Stokolos, A.
    CONSTRUCTIVE APPROXIMATION, 2020, 51 (02) : 383 - 412
  • [42] Stability theory for hybrid dynamical systems
    Ye, H
    Michel, AN
    Hou, L
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1998, 43 (04) : 461 - 474
  • [43] Modal specifications for the control theory of discrete event systems
    Feuillade, Guillaume
    Pinchinat, Sophie
    DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2007, 17 (02): : 211 - 232
  • [44] Modal Specifications for the Control Theory of Discrete Event Systems
    Guillaume Feuillade
    Sophie Pinchinat
    Discrete Event Dynamic Systems, 2007, 17 : 211 - 232
  • [45] Hybrid Optimal Control Approach to Commercial Aircraft Trajectory Planning
    Soler, M.
    Olivares, A.
    Staffetti, E.
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2010, 33 (03) : 985 - 991
  • [46] Theory and applications of HVAC control systems - A review of model predictive control (MPC)
    Afram, Abdul
    Janabi-Sharifi, Farrokh
    BUILDING AND ENVIRONMENT, 2014, 72 : 343 - 355
  • [47] MULTILINEAR CONTROL SYSTEMS THEORY
    Chen, Can
    Surana, Amit
    Bloch, Anthony M.
    Rajapakse, Indika
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2021, 59 (01) : 749 - 776
  • [48] SYMBIOTIC SIMULATION SYSTEM: HYBRID SYSTEMS MODEL MEETS BIG DATA ANALYTICS
    Onggo, Bhakti Stephan
    Mustafee, Navonil
    Smart, Andi
    Juan, Angel A.
    Molloy, Owen
    2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 1358 - 1369
  • [49] Sliding mode control of switched hybrid systems with stochastic perturbation
    Wu, Ligang
    Ho, Daniel W. C.
    Li, C. W.
    SYSTEMS & CONTROL LETTERS, 2011, 60 (08) : 531 - 539
  • [50] Timed Petri Nets in Hybrid Systems: Stability and Supervisory Control
    Xenofon D. Koutsoukos
    Kevin X. He
    Michael D. Lemmon
    Panos J. Antsaklis
    Discrete Event Dynamic Systems, 1998, 8 : 137 - 173