Control Theory Meets POMDPs: A Hybrid Systems Approach

被引:2
|
作者
Ahmadi, Mohamadreza [1 ]
Jansen, Nils [2 ]
Wu, Bo [3 ]
Topcu, Ufuk [3 ]
机构
[1] CALTECH, Ctr Autonomous Syst & Technol, Pasadena, CA 91125 USA
[2] Radboud Univ Nijmegen, Inst Comp & Informat Sci, Dept Software Sci, NL-6500 GL Nijmegen, Netherlands
[3] Univ Texas Austin, Oden Inst Computat Engn & Sci, Austin, TX 78712 USA
关键词
Safety; Decision making; Markov processes; Kalman filters; Control theory; Bayes methods; Switched systems; Artificial intelligence; autonomous systems; control theory; Lyapunov methods; MARKOV-PROCESSES; OPTIMIZATION; APPROXIMATIONS; STABILITY;
D O I
10.1109/TAC.2020.3035755
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partially observable Markov decision processes (POMDPs) provide a modeling framework for a variety of sequential decision making under uncertainty scenarios in artificial intelligence (AI). Since the states are not directly observable in a POMDP, decision making has to be performed based on the output of a Bayesian filter (continuous beliefs); hence, making POMDPs intractable to solve and analyze. To overcome the complexity challenge of POMDPs, we apply techniques from the control theory. Our contributions are fourfold. 1) We begin by casting the problem of analyzing a POMDP into analyzing the behavior of a discrete-time switched system. 2) Then, in order to estimate the reachable belief space of a POMDP, i.e., the set of all possible evolutions given an initial belief distribution over the states and a set of actions and observations, we find overapproximations in terms of sublevel sets of Lyapunov-like functions. 3) Furthermore, in order to verify safety and performance requirements of a given POMDP, we formulate a barrier certificate theorem, wherein we show that if there exists a barrier certificate satisfying a set of inequalities along the solutions to the belief update equation of the POMDP, the safety and performance properties are guaranteed to hold. In both cases 2) and 3), the calculations can be decomposed and solved in parallel. 4) Finally, we show that the conditions we formulate can be computationally implemented as a set of sum-of-squares programs. We illustrate the applicability of our method by addressing two problems in active ad scheduling and machine teaching.
引用
收藏
页码:5191 / 5204
页数:14
相关论文
共 50 条
  • [1] A Hybrid Approach Combining Control Theory and AI for Engineering Self-Adaptive Systems
    Caldas, Ricardo Diniz
    Rodrigues, Arthur
    Gil, Eric Bernd
    Rodrigues, Genaina Nunes
    Vogel, Thomas
    Pelliccione, Patrizio
    2020 IEEE/ACM 15TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS, SEAMS, 2020, : 9 - 19
  • [2] Control theory meets synthetic biology
    Del Vecchio, Domitilla
    Dy, Aaron J.
    Qian, Yili
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2016, 13 (120)
  • [3] Steering of Discrete Event Systems: Control Theory Approach
    Easwaran, Arvind
    Kannan, Sampath
    Sokolsky, Oleg
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2006, 144 (04) : 21 - 39
  • [4] Monte Carlo Planning in Hybrid Belief POMDPs
    Barenboim, Moran
    Shienman, Moshe
    Indelman, Vadim
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (08) : 4410 - 4417
  • [5] The minimum principle of hybrid optimal control theory
    Pakniyat, Ali
    Caines, Peter E.
    MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 2024, 36 (01) : 21 - 70
  • [6] Decentralized Coordination of Multi-Agent Systems Based on POMDPs and Consensus for Active Perception
    Peti, Marijana
    Petric, Frano
    Bogdan, Stjepan
    IEEE ACCESS, 2023, 11 : 52480 - 52491
  • [7] Hysteresis-Based Current Control Approach by Means of the Theory of Switched Systems
    Thielmann, Malte
    Hans, Florian
    APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [8] Observer Design for Linear Aperiodic Sampled-Data Systems: A Hybrid Systems Approach
    Ferrante, Francesco
    Seuret, Alexandre
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 470 - 475
  • [9] Neuroadaptive Control of Nonholonomic Systems With Function Constraints: Theory and Experiment
    Gao, Yang
    Zhang, Zhongcai
    Jiang, Nan
    Wu, Yuqiang
    IET CONTROL THEORY AND APPLICATIONS, 2025, 19 (01)
  • [10] An Efficient Hybrid Approach for Volt/Var Control in Distribution Systems
    Mohapatra, Abheejeet
    Bijwe, Pradeep R.
    Panigrahi, Bijaya K.
    IEEE TRANSACTIONS ON POWER DELIVERY, 2014, 29 (04) : 1780 - 1788