Learning-Based Bounded Synthesis for Semi-MDPs With LTL Specifications

被引:0
|
作者
Oura, Ryohei [1 ]
Ushio, Toshimitsu [1 ]
机构
[1] Osaka Univ, Grad Sch Engn Sci, Toyonaka, Osaka 5608531, Japan
来源
IEEE CONTROL SYSTEMS LETTERS | 2022年 / 6卷
基金
日本科学技术振兴机构;
关键词
Bounded synthesis; linear temporal logic; reinforcement learning; Bayesian inference; semi-Markov decision process;
D O I
10.1109/LCSYS.2022.3169982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic K-co-Buchi automaton (dKcBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Bayesian inference. Then, we synthesize an optimal policy satisfying the following two conditions. (1) It maximizes the probability of reaching the wining region. (2) It minimizes a long-term risk for the dwell time within the winning region. The minimization of the long-term risk is done based on the estimated dynamics and a value iteration. We show that, if the discount factor is sufficiently close to one, the synthesized policy converges to the optimal policy as the number of the data obtained by the exploration goes to the infinity.
引用
收藏
页码:2557 / 2562
页数:6
相关论文
共 50 条
  • [21] Advances in Deep Learning-Based Program Synthesis
    Gou, Qian-Wen
    Dong, Yun-Wei
    Li, Yong-Min
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (11): : 2594 - 2628
  • [22] Learning-Based Path Planning Under Co-Safe Temporal Logic Specifications
    Cho, Kyunghoon
    IEEE ACCESS, 2023, 11 : 25865 - 25878
  • [23] Learning-Based Risk-Bounded Path Planning Under Environmental Uncertainty
    Meng, Fei
    Chen, Liangliang
    Ma, Han
    Wang, Jiankun
    Meng, Max Q. -H.
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) : 4460 - 4470
  • [24] Semi-Supervised Learning-Based Image Denoising for Big Data
    Zhang, Kun
    Chen, Kai
    IEEE ACCESS, 2020, 8 : 172678 - 172691
  • [25] Semi-Supervised Learning-Based Method for Unknown Anomaly Detection
    Cheng, Yudong
    Zhou, Fang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (07): : 1670 - 1680
  • [26] Deep Learning-Based Path Planning Under Co-Safe Temporal Logic Specifications
    Lee, Kyoungho
    Cho, Kyunghoon
    IEEE ACCESS, 2024, 12 (7704-7718) : 7704 - 7718
  • [27] A Deep Learning-Based Approach to Design Metasurfaces From Desired Far-Field Specifications
    Niu, Chen
    Phaneuf, Mario
    Qiu, Tianke
    Mojabi, Puyan
    IEEE OPEN JOURNAL OF ANTENNAS AND PROPAGATION, 2023, 4 : 641 - 653
  • [28] Learning-Based Template Synthesis for Groupwise Image Registration
    He, Ziyi
    Chung, Albert C. S.
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2021, 2021, 12965 : 55 - 66
  • [29] Learning-Based Sphere Nonlinear Interpolation for Motion Synthesis
    Xia, Guiyu
    Sun, Huaijiang
    Liu, Qingshan
    Hang, Renlong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (05) : 2927 - 2937
  • [30] Learning-Based View Synthesis for Light Field Cameras
    Kalantari, Nima Khademi
    Wang, Ting-Chun
    Ramamoorthi, Ravi
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):