Learning Rational Subgoals from Demonstrations and Instructions

被引：0

作者：

Luo, Zhezheng ^{[1
]}

Mao, Jiayuan ^{[1
]}

Wu, Jiajun ^{[2
]}

Lozano-Perez, Tomas ^{[1
]}

Tenenbaum, Joshua B. ^{[1
]}

Kaelbling, Leslie Pack ^{[1
]}

机构：

[1] MIT, Cambridge, MA 02139 USA

[2] Stanford Univ, Stanford, CA USA

来源：

THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a framework for learning useful subgoals that support efficient long-term planning to achieve novel goals. At the core of our framework is a collection of rational subgoals (RSGs), which are essentially binary classifiers over the environmental states. RSGs can be learned from weakly-annotated data, in the form of unsegmented demonstration trajectories, paired with abstract task descriptions, which are composed of terms initially unknown to the agent (e.g., collect-wood then craft-boat then go-across-river). Our framework also discovers dependencies between RSGs, e.g., the task collect-wood is a helpful subgoal for the task craft-boat. Given a goal description, the learned subgoals and the derived dependencies facilitate off-the-shelf planning algorithms, such as A* and RRT, by setting helpful subgoals as waypoints to the planner, which significantly improves performance-time efficiency. Project page: https://rsg.csail.mit.edu

引用

页码：12068 / 12078

页数：11

共 45 条

[1]

Andreas J., 2015, EMNLP, P1165

[2]

Andreas J, 2017, PR MACH LEARN RES, V70

[3]

[Anonymous], 2007, ICAPS

[4] A survey of robot learning from demonstration [J].