Learning Task Specifications from Demonstrations

被引:0
|
作者
Vazquez-Chanlatte, Marcell [1 ]
Jha, Susmit [2 ]
Tiwari, Ashish [2 ]
Ho, Mark K. [1 ]
Seshia, Sanjit A. [1 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] SRI Int, 333 Ravenswood Ave, Menlo Pk, CA 94025 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018) | 2018年 / 31卷
基金
美国国家科学基金会;
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-world applications often naturally decompose into several sub-tasks. In many settings (e.g., robotics) demonstrations provide a natural way to specify the sub-tasks. However, most methods for learning from demonstrations either do not provide guarantees that the artifacts learned for the sub-tasks can be safely recombined or limit the types of composition available. Motivated by this deficit, we consider the problem of inferring Boolean non-Markovian rewards (also known as logical trace properties or specifications) from demonstrations provided by an agent operating in an uncertain, stochastic environment. Crucially, specifications admit well-defined composition rules that are typically easy to interpret. In this paper, we formulate the specification inference task as a maximum a posteriori (MAP) probability inference problem, apply the principle of maximum entropy to derive an analytic demonstration likelihood model and give an efficient approach to search for the most likely specification in a large candidate pool of specifications. In our experiments, we demonstrate how learning specifications can help avoid common problems that often arise due to ad-hoc reward composition.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] A Task-Learning Strategy for Robotic Assembly Tasks from Human Demonstrations
    Ding, Guanwen
    Liu, Yubin
    Zang, Xizhe
    Zhang, Xuehe
    Liu, Gangfeng
    Zhao, Jie
    SENSORS, 2020, 20 (19) : 1 - 23
  • [22] An improved approach of task-parameterized learning from demonstrations for cobots in dynamic manufacturing
    Shirine El Zaatari
    Yuqi Wang
    Yudie Hu
    Weidong Li
    Journal of Intelligent Manufacturing, 2022, 33 : 1503 - 1519
  • [23] f-Divergence Optimization for Task-Parameterized Learning from Demonstrations Algorithm
    Prados, Adrian
    Mendez, Alberto
    Espinoza, Gonzalo
    Fernandez, Noelia
    Barber, Ramon
    2024 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2024, : 9 - 14
  • [24] An improved approach of task-parameterized learning from demonstrations for cobots in dynamic manufacturing
    El Zaatari, Shirine
    Wang, Yuqi
    Hu, Yudie
    Li, Weidong
    JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (05) : 1503 - 1519
  • [25] Robot life-long task learning from human demonstrations: a Bayesian approach
    Nathan Koenig
    Maja J. Matarić
    Autonomous Robots, 2017, 41 : 1173 - 1188
  • [26] Interactive Task Learning from GUI-Grounded Natural Language Instructions and Demonstrations
    Li, Toby Jia-Jun
    Mitchell, Tom M.
    Myers, Brad A.
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, 2020, : 215 - 223
  • [27] Robot life-long task learning from human demonstrations: a Bayesian approach
    Koenig, Nathan
    Mataric, Maja J.
    AUTONOMOUS ROBOTS, 2017, 41 (05) : 1173 - 1188
  • [28] Learning From Sparse Demonstrations
    Jin, Wanxin
    Murphey, Todd D.
    Kulic, Dana
    Ezer, Neta
    Mou, Shaoshuai
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (01) : 645 - 664
  • [29] Learning to Generalize from Demonstrations
    Browne, Katie
    Nicolescu, Monica
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2012, 12 (03) : 27 - 38
  • [30] Robot learning of industrial assembly task via human demonstrations
    Kyrarini, Maria
    Haseeb, Muhammad Abdul
    Ristic-Durrant, Danijela
    Graeser, Axel
    AUTONOMOUS ROBOTS, 2019, 43 (01) : 239 - 257