Learning-Based Bounded Synthesis for Semi-MDPs With LTL Specifications

被引:0
|
作者
Oura, Ryohei [1 ]
Ushio, Toshimitsu [1 ]
机构
[1] Osaka Univ, Grad Sch Engn Sci, Toyonaka, Osaka 5608531, Japan
来源
IEEE CONTROL SYSTEMS LETTERS | 2022年 / 6卷
基金
日本科学技术振兴机构;
关键词
Bounded synthesis; linear temporal logic; reinforcement learning; Bayesian inference; semi-Markov decision process;
D O I
10.1109/LCSYS.2022.3169982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic K-co-Buchi automaton (dKcBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Bayesian inference. Then, we synthesize an optimal policy satisfying the following two conditions. (1) It maximizes the probability of reaching the wining region. (2) It minimizes a long-term risk for the dwell time within the winning region. The minimization of the long-term risk is done based on the estimated dynamics and a value iteration. We show that, if the discount factor is sufficiently close to one, the synthesized policy converges to the optimal policy as the number of the data obtained by the exploration goes to the infinity.
引用
收藏
页码:2557 / 2562
页数:6
相关论文
共 50 条
  • [31] A Benchmarking Platform for Learning-Based Grasp Synthesis Methodologies
    van Vuuren, Jacques Janse
    Tang, Liqiong
    Al-Bahadly, Ibrahim
    Arif, Khalid Mahmood
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)
  • [32] Deep Learning-Based Synthesis for Normalization of FLAIR Imaging
    Dewey, Blake E.
    Zhao, Can
    Caldito, Natalia Gonzalez
    Sotirchos, Elias S.
    Saidha, Shiv
    Prince, Jerry
    Calabresi, Peter A.
    MULTIPLE SCLEROSIS JOURNAL, 2018, 24 : 63 - 64
  • [33] A Benchmarking Platform for Learning-Based Grasp Synthesis Methodologies
    Jacques Janse van Vuuren
    Liqiong Tang
    Ibrahim Al-Bahadly
    Khalid Mahmood Arif
    Journal of Intelligent & Robotic Systems, 2021, 102
  • [34] A Learning-Based Personalized Driver Model Using Bounded Generalized Gaussian Mixture Models
    Wang, Wenshuo
    Xi, Junqiang
    Hedrick, J. Karl
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (12) : 11679 - 11690
  • [35] Self-paced learning-based multi-graphs semi-supervised learning
    Lin Wan
    Chengbin Dong
    Xiaobing Pei
    Multimedia Tools and Applications, 2022, 81 : 7025 - 7046
  • [36] Self-paced learning-based multi-graphs semi-supervised learning
    Wan, Lin
    Dong, Chengbin
    Pei, Xiaobing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (05) : 7025 - 7046
  • [37] Deep Semi-Supervised Learning-Based Spectrum Sensing at Low SNR
    Xu, Guanghai
    Wang, Yonghua
    Zheng, Bingfeng
    Li, Jiawen
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (11) : 2558 - 2562
  • [38] Semi-supervised Deep Learning-based Methods for Indoor Outdoor Detection
    Saffar, Illyyne
    Morel, Marie Line Alberi
    Singh, Kamal Deep
    Viho, Cesar
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [39] A Semi-supervised Deep Learning-Based Solver for Breaking Text-Based CAPTCHAs
    Deng, Xianwen
    Zhao, Ruijie
    Xue, Zhi
    Liu, Ming
    Chen, Libo
    Wang, Yijun
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 614 - 619
  • [40] Learning-based Robust Model Predictive Control for Sector-bounded Lur'e Systems
    Seel, Katrine
    Haring, Mark
    Grotli, Esten, I
    Pettersen, Kristin Y.
    Gravdahl, Jan T.
    IFAC PAPERSONLINE, 2021, 54 (20): : 46 - 52