Probabilistic Specification Learning for Planning with Safety Constraints

被引:3
作者
Watanabe, Kandai [1 ,2 ]
Renninger, Nicholas [2 ]
Sankaranarayanan, Sriram [1 ,2 ]
Lahijanian, Morteza [1 ,2 ]
机构
[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[2] Univ Colorado, Dept Aerosp Engn Sci, Boulder, CO 80309 USA
来源
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2021年
基金
美国国家科学基金会;
关键词
D O I
10.1109/IROS51168.2021.9636712
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a framework for learning task specifications from demonstrations, while ensuring that the learned specifications do not violate safety constraints. Furthermore, we show how these specifications can be used in a planning problem to control the robot under environments that can be different from those encountered during the learning phase. We formulate the specification learning problem as a grammatical inference problem, using probabilistic automata to represent specifications. The edge probabilities of the resulting automata represent the demonstrator's preferences. The main novelty in our approach is to incorporate the safety property during the learning process. We prove that the resulting automaton always respects a pre-specified safety property, and furthermore, the proposed method can easily be included in any Evidence-Driven State Merging (EDSM)-based automaton learning scheme. Finally, we introduce a planning algorithm that produces the most desirable plan by maximizing the probability of an accepting trace of the automaton. Case studies show that our algorithm learns the true probability distribution most accurately while maintaining safety. Since, specification is detached from the robot's environment model, a satisfying plan can be synthesized for a variety of different robots and environments including both mobile robots and manipulators.
引用
收藏
页码:6558 / 6565
页数:8
相关论文
共 28 条
  • [1] Araki B., 2019, LEARNING PLAN LOGICA
  • [2] Baier C, 2008, PRINCIPLES OF MODEL CHECKING, P1
  • [3] Camacho A, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P6065
  • [4] De Giacomo G, 2013, P 23 INT JOINT C ART, V13, P854
  • [5] De la Higuera C., 2010, GRAMMATICAL INFERENC
  • [6] Automated Abstraction of Manipulation Domains for Cost-Based Reactive Synthesis
    He, Keliang
    Lahijanian, Morteza
    Kavraki, Lydia E.
    Vardi, Moshe Y.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02): : 285 - 292
  • [7] He KL, 2015, IEEE INT CONF ROBOT, P346, DOI 10.1109/ICRA.2015.7139022
  • [8] Imitation Learning: A Survey of Learning Methods
    Hussein, Ahmed
    Gaber, Mohamed Medhat
    Elyan, Eyad
    Jayne, Chrisina
    [J]. ACM COMPUTING SURVEYS, 2017, 50 (02)
  • [9] TeLEx: Passive STL Learning Using Only Positive Examples
    Jha, Susmit
    Tiwari, Ashish
    Seshia, Sanjit A.
    Sahai, Tuhin
    Shankar, Natarajan
    [J]. RUNTIME VERIFICATION (RV 2017), 2017, 10548 : 208 - 224
  • [10] Kress-Gazit H., ANN REV CONTROL ROBO, V1, P211