PROGPROMPT: program generation for situated robot task planning using large language models
被引:8
作者:
Singh, Ishika
论文数: 0引用数: 0
h-index: 0
机构:
Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Singh, Ishika
[1
]
Blukis, Valts
论文数: 0引用数: 0
h-index: 0
机构:
NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Blukis, Valts
[2
]
Mousavian, Arsalan
论文数: 0引用数: 0
h-index: 0
机构:
NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Mousavian, Arsalan
[2
]
Goyal, Ankit
论文数: 0引用数: 0
h-index: 0
机构:
NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Goyal, Ankit
[2
]
Xu, Danfei
论文数: 0引用数: 0
h-index: 0
机构:
NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Xu, Danfei
[2
]
Tremblay, Jonathan
论文数: 0引用数: 0
h-index: 0
机构:
NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Tremblay, Jonathan
[2
]
Fox, Dieter
论文数: 0引用数: 0
h-index: 0
机构:
NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA
Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Fox, Dieter
[2
,3
]
Thomason, Jesse
论文数: 0引用数: 0
h-index: 0
机构:
Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Thomason, Jesse
[1
]
Garg, Animesh
论文数: 0引用数: 0
h-index: 0
机构:
NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA
Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30308 USAUniv Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
Garg, Animesh
[2
,4
]
机构:
[1] Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
[2] NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA
[3] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA
[4] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30308 USA
Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io