Learning Task Specifications from Demonstrations

被引:0
|
作者
Vazquez-Chanlatte, Marcell [1 ]
Jha, Susmit [2 ]
Tiwari, Ashish [2 ]
Ho, Mark K. [1 ]
Seshia, Sanjit A. [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷
基金
美国国家科学基金会;
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-world applications often naturally decompose into several sub-tasks. In many settings (e.g., robotics) demonstrations provide a natural way to specify the sub-tasks. However, most methods for learning from demonstrations either do not provide guarantees that the artifacts learned for the sub-tasks can be safely recombined or limit the types of composition available. Motivated by this deficit, we consider the problem of inferring Boolean non-Markovian rewards (also known as logical trace properties or specifications) from demonstrations provided by an agent operating in an uncertain, stochastic environment. Crucially, specifications admit well-defined composition rules that are typically easy to interpret. In this paper, we formulate the specification inference task as a maximum a posteriori (MAP) probability inference problem, apply the principle of maximum entropy to derive an analytic demonstration likelihood model and give an efficient approach to search for the most likely specification in a large candidate pool of specifications. In our experiments, we demonstrate how learning specifications can help avoid common problems that often arise due to ad-hoc reward composition.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Learning Temporal Task Specifications From Demonstrations
    Baert, Mattijs
    Leroux, Sam
    Simoens, Pieter
    EXPLAINABLE AND TRANSPARENT AI AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2024, 2024, 14847 : 81 - 98
  • [2] Bayesian Inference of Temporal Task Specifications from Demonstrations
    Shah, Ankit
    Kamath, Pritish
    Li, Shen
    Shah, Julie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] Using Causal Analysis to Learn Specifications from Task Demonstrations
    Angelov, Daniel
    Hristov, Yordan
    Ramamoorthy, Subramanian
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1341 - 1349
  • [4] Learning Task Priorities From Demonstrations
    Silverio, Joao
    Calinon, Sylvain
    Rozo, Leonel
    Caldwell, Darwin G.
    IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 78 - 94
  • [5] From demonstrations to task-space specifications. Using causal analysis to extract rule parameterization from demonstrations
    Angelov, Daniel
    Hristov, Yordan
    Ramamoorthy, Subramanian
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (02)
  • [6] From demonstrations to task-space specifications. Using causal analysis to extract rule parameterization from demonstrations
    Daniel Angelov
    Yordan Hristov
    Subramanian Ramamoorthy
    Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [7] Learning from Demonstrations with Partially Observable Task Parameters
    Alizadeh, Tohid
    Calinon, Sylvain
    Caldwell, Darwin G.
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3309 - 3314
  • [8] Efficient Inference of Temporal Task Specifications from Human Demonstrations using Experiment Design
    Sobti, Shlok
    Shome, Rahul
    Kavraki, Lydia E.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9764 - 9770
  • [9] Learning Task-Parameterized Skills From Few Demonstrations
    Zhu, Jihong
    Gienger, Michael
    Kober, Jens
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 4063 - 4070
  • [10] Learning Temporal Task Models from Human Bimanual Demonstrations
    Dreher, Christian R. G.
    Asfour, Tam
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 7664 - 7671