Learning Rational Subgoals from Demonstrations and Instructions

被引:0
作者
Luo, Zhezheng [1 ]
Mao, Jiayuan [1 ]
Wu, Jiajun [2 ]
Lozano-Perez, Tomas [1 ]
Tenenbaum, Joshua B. [1 ]
Kaelbling, Leslie Pack [1 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Stanford Univ, Stanford, CA USA
来源
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for learning useful subgoals that support efficient long-term planning to achieve novel goals. At the core of our framework is a collection of rational subgoals (RSGs), which are essentially binary classifiers over the environmental states. RSGs can be learned from weakly-annotated data, in the form of unsegmented demonstration trajectories, paired with abstract task descriptions, which are composed of terms initially unknown to the agent (e.g., collect-wood then craft-boat then go-across-river). Our framework also discovers dependencies between RSGs, e.g., the task collect-wood is a helpful subgoal for the task craft-boat. Given a goal description, the learned subgoals and the derived dependencies facilitate off-the-shelf planning algorithms, such as A* and RRT, by setting helpful subgoals as waypoints to the planner, which significantly improves performance-time efficiency. Project page: https://rsg.csail.mit.edu
引用
收藏
页码:12068 / 12078
页数:11
相关论文
共 45 条
[1]  
Andreas J., 2015, EMNLP, P1165
[2]  
Andreas J, 2017, PR MACH LEARN RES, V70
[3]  
[Anonymous], 2007, ICAPS
[4]   A survey of robot learning from demonstration [J].
Argall, Brenna D. ;
Chernova, Sonia ;
Veloso, Manuela ;
Browning, Brett .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (05) :469-483
[5]   Action understanding as inverse planning [J].
Baker, Chris L. ;
Saxe, Rebecca ;
Tenenbaum, Joshua B. .
COGNITION, 2009, 113 (03) :329-349
[6]   Recent Advances in Hierarchical Reinforcement Learning [J].
Andrew G. Barto ;
Sridhar Mahadevan .
Discrete Event Dynamic Systems, 2003, 13 (4) :341-379
[7]   Model-based hierarchical reinforcement learning and human action control [J].
Botvinick, Matthew ;
Weinstein, Ari .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2014, 369 (1655)
[8]  
Bradley C., 2021, arXiv
[9]  
Chen V., 2021, ICLR
[10]  
Chernova S., 2007, AAMAS