Probabilistic Specification Learning for Planning with Safety Constraints

被引：3

作者：

Watanabe, Kandai ^{[1
,2
]}

Renninger, Nicholas ^{[2
]}

Sankaranarayanan, Sriram ^{[1
,2
]}

Lahijanian, Morteza ^{[1
,2
]}

机构：

[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA

[2] Univ Colorado, Dept Aerosp Engn Sci, Boulder, CO 80309 USA

来源：

2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/IROS51168.2021.9636712

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a framework for learning task specifications from demonstrations, while ensuring that the learned specifications do not violate safety constraints. Furthermore, we show how these specifications can be used in a planning problem to control the robot under environments that can be different from those encountered during the learning phase. We formulate the specification learning problem as a grammatical inference problem, using probabilistic automata to represent specifications. The edge probabilities of the resulting automata represent the demonstrator's preferences. The main novelty in our approach is to incorporate the safety property during the learning process. We prove that the resulting automaton always respects a pre-specified safety property, and furthermore, the proposed method can easily be included in any Evidence-Driven State Merging (EDSM)-based automaton learning scheme. Finally, we introduce a planning algorithm that produces the most desirable plan by maximizing the probability of an accepting trace of the automaton. Case studies show that our algorithm learns the true probability distribution most accurately while maintaining safety. Since, specification is detached from the robot's environment model, a satisfying plan can be synthesized for a variety of different robots and environments including both mobile robots and manipulators.

引用

页码：6558 / 6565

页数：8

共 28 条

[1] Araki B., 2019, LEARNING PLAN LOGICA
[2] Baier C, 2008, PRINCIPLES OF MODEL CHECKING, P1
[3] Camacho A, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P6065
[4] De Giacomo G, 2013, P 23 INT JOINT C ART, V13, P854
[5] De la Higuera C., 2010, GRAMMATICAL INFERENC
[6] Automated Abstraction of Manipulation Domains for Cost-Based Reactive Synthesis
He, Keliang
Lahijanian, Morteza
Kavraki, Lydia E.
Vardi, Moshe Y.
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02): : 285 - 292
[7] He KL, 2015, IEEE INT CONF ROBOT, P346, DOI 10.1109/ICRA.2015.7139022
[8] Imitation Learning: A Survey of Learning Methods
Hussein, Ahmed
Gaber, Mohamed Medhat
Elyan, Eyad
Jayne, Chrisina
[J]. ACM COMPUTING SURVEYS, 2017, 50 (02)
[9] TeLEx: Passive STL Learning Using Only Positive Examples
Jha, Susmit
Tiwari, Ashish
Seshia, Sanjit A.
Sahai, Tuhin
Shankar, Natarajan
[J]. RUNTIME VERIFICATION (RV 2017), 2017, 10548 : 208 - 224
[10] Kress-Gazit H., ANN REV CONTROL ROBO, V1, P211

← 1 2 3 →