Learning-Based Bounded Synthesis for Semi-MDPs With LTL Specifications

被引：0

作者：

Oura, Ryohei ^{[1
]}

Ushio, Toshimitsu ^{[1
]}

机构：

[1] Osaka Univ, Grad Sch Engn Sci, Toyonaka, Osaka 5608531, Japan

来源：

IEEE CONTROL SYSTEMS LETTERS | 2022年 / 6卷

基金：

日本科学技术振兴机构;

关键词：

Bounded synthesis; linear temporal logic; reinforcement learning; Bayesian inference; semi-Markov decision process;

D O I：

10.1109/LCSYS.2022.3169982

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This letter proposes a learning-based bounded synthesis for a semi-Markov decision process (SMDP) with a linear temporal logic (LTL) specification. In the product of the SMDP and the deterministic K-co-Buchi automaton (dKcBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Bayesian inference. Then, we synthesize an optimal policy satisfying the following two conditions. (1) It maximizes the probability of reaching the wining region. (2) It minimizes a long-term risk for the dwell time within the winning region. The minimization of the long-term risk is done based on the estimated dynamics and a value iteration. We show that, if the discount factor is sufficiently close to one, the synthesized policy converges to the optimal policy as the number of the data obtained by the exploration goes to the infinity.

引用

页码：2557 / 2562

页数：6

共 50 条

[31] A Benchmarking Platform for Learning-Based Grasp Synthesis Methodologies
van Vuuren, Jacques Janse
Tang, Liqiong
Al-Bahadly, Ibrahim
Arif, Khalid Mahmood
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)
[32] Deep Learning-Based Synthesis for Normalization of FLAIR Imaging
Dewey, Blake E.
Zhao, Can
Caldito, Natalia Gonzalez
Sotirchos, Elias S.
Saidha, Shiv
Prince, Jerry
Calabresi, Peter A.
MULTIPLE SCLEROSIS JOURNAL, 2018, 24 : 63 - 64
[33] A Benchmarking Platform for Learning-Based Grasp Synthesis Methodologies
Jacques Janse van Vuuren
Liqiong Tang
Ibrahim Al-Bahadly
Khalid Mahmood Arif
Journal of Intelligent & Robotic Systems, 2021, 102
[34] A Learning-Based Personalized Driver Model Using Bounded Generalized Gaussian Mixture Models
Wang, Wenshuo
Xi, Junqiang
Hedrick, J. Karl
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (12) : 11679 - 11690
[35] Self-paced learning-based multi-graphs semi-supervised learning
Lin Wan
Chengbin Dong
Xiaobing Pei
Multimedia Tools and Applications, 2022, 81 : 7025 - 7046
[36] Self-paced learning-based multi-graphs semi-supervised learning
Wan, Lin
Dong, Chengbin
Pei, Xiaobing
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (05) : 7025 - 7046
[37] Deep Semi-Supervised Learning-Based Spectrum Sensing at Low SNR
Xu, Guanghai
Wang, Yonghua
Zheng, Bingfeng
Li, Jiawen
IEEE COMMUNICATIONS LETTERS, 2024, 28 (11) : 2558 - 2562
[38] Semi-supervised Deep Learning-based Methods for Indoor Outdoor Detection
Saffar, Illyyne
Morel, Marie Line Alberi
Singh, Kamal Deep
Viho, Cesar
ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[39] A Semi-supervised Deep Learning-Based Solver for Breaking Text-Based CAPTCHAs
Deng, Xianwen
Zhao, Ruijie
Xue, Zhi
Liu, Ming
Chen, Libo
Wang, Yijun
2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 614 - 619
[40] Learning-based Robust Model Predictive Control for Sector-bounded Lur'e Systems
Seel, Katrine
Haring, Mark
Grotli, Esten, I
Pettersen, Kristin Y.
Gravdahl, Jan T.
IFAC PAPERSONLINE, 2021, 54 (20): : 46 - 52

← 1 2 3 4 5 →