Learning-Based Bounded Synthesis for Semi-MDPs With LTL Specifications

被引:0
|
作者
Oura, Ryohei [1 ]
Ushio, Toshimitsu [1 ]
机构
[1] Osaka Univ, Grad Sch Engn Sci, Toyonaka, Osaka 5608531, Japan
来源
IEEE CONTROL SYSTEMS LETTERS | 2022年 / 6卷
基金
日本科学技术振兴机构;
关键词
Bounded synthesis; linear temporal logic; reinforcement learning; Bayesian inference; semi-Markov decision process;
D O I
10.1109/LCSYS.2022.3169982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic K-co-Buchi automaton (dKcBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Bayesian inference. Then, we synthesize an optimal policy satisfying the following two conditions. (1) It maximizes the probability of reaching the wining region. (2) It minimizes a long-term risk for the dwell time within the winning region. The minimization of the long-term risk is done based on the estimated dynamics and a value iteration. We show that, if the discount factor is sufficiently close to one, the synthesized policy converges to the optimal policy as the number of the data obtained by the exploration goes to the infinity.
引用
收藏
页码:2557 / 2562
页数:6
相关论文
共 50 条
  • [41] NeuralSound: Learning-based Modal Sound Synthesis with Acoustic Transfer
    Jin, Xutong
    Li, Sheng
    Wang, Guoping
    Manocha, Dinesh
    ACM TRANSACTIONS ON GRAPHICS, 2022, 41 (04):
  • [42] Machine Learning-Based View Synthesis in Fourier Lightfield Microscopy
    Rostan, Julen
    Incardona, Nicolo
    Sanchez-Ortiga, Emilio
    Martinez-Corral, Manuel
    Latorre-Carmona, Pedro
    SENSORS, 2022, 22 (09)
  • [43] A novel machine learning-based spatialized population synthesis framework
    Khachman, Mohamed
    Morency, Catherine
    Ciari, Francesco
    TRANSPORTATION, 2024,
  • [44] A DEEP LEARNING-BASED PRESSURE MATCHING APPROACH TO SOUNDFIELD SYNTHESIS
    Comanducci, Luca
    Antonacci, Fabio
    Sarti, Augusto
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [45] Deep learning-based general beam synthesis for atmospheric propagation
    Wang, Minghao
    Zhang, Dejun
    Liang, Wenke
    Guo, Wen
    OPTICS EXPRESS, 2024, 32 (17): : 29159 - 29173
  • [46] A Generic Semi-Supervised Deep Learning-Based Approach for Automated Surface Inspection
    Zheng, Xiaoqing
    Wang, Hongcheng
    Chen, Jie
    Kong, Yaguang
    Zheng, Song
    IEEE ACCESS, 2020, 8 : 114088 - 114099
  • [47] An adaptive semi-supervised deep learning-based framework for the detection of Android malware
    Wajahat, Ahsan
    He, Jingsha
    Zhu, Nafei
    Mahmood, Tariq
    Nazir, Ahsan
    Pathan, Muhammad Salman
    Qureshi, Sirajuddin
    Ullah, Faheem
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 5141 - 5157
  • [48] SPIDER: A Semi-Supervised Continual Learning-based Network Intrusion Detection System
    Amalapuram, Suresh Kumar
    Tamma, Bheemarjuna Reddy
    Channappayya, Sumohana S.
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 571 - 580
  • [49] Hypergraph Learning-Based Semi-Supervised Multi-View Spectral Clustering
    Yang, Geng
    Li, Qin
    Yun, Yu
    Lei, Yu
    You, Jane
    ELECTRONICS, 2023, 12 (19)
  • [50] SSL-SVD: Semi-supervised Learning-based Sparse Trust Recommendation
    Hu, Zhengdi
    Xu, Guangquan
    Zheng, Xi
    Liu, Jiang
    Li, Zhangbing
    Sheng, Quan Z.
    Lian, Wenjuan
    Xian, Hequn
    ACM TRANSACTIONS ON INTERNET TECHNOLOGY, 2020, 20 (01)