Learning Neuro-symbolic Programs for Language Guided Robot Manipulation

被引：4

作者：

Namasivayam, K. ^{[1
]}

Singh, Himanshu ^{[1
]}

Bindal, Vishal ^{[1
]}

Tuli, Arnav ^{[1
]}

Agrawal, Vishwajeet ^{[1
]}

Jain, Rahul ^{[1
]}

Singla, Parag ^{[1
]}

Paul, Rohan ^{[1
]}

机构：

[1] IIT Delhi, New Delhi, India

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023) | 2023年

关键词：

SKILLS;

D O I：

10.1109/ICRA48891.2023.10160545

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Given a natural language instruction and an input scene, our goal is to train a model to output a manipulation program that can be executed by the robot. Prior approaches for this task possess one of the following limitations: (i) rely on hand-coded symbols for concepts limiting generalization beyond those seen during training [1] (ii) infer action sequences from instructions but require dense sub-goal supervision [2] or (iii) lack semantics required for deeper object-centric reasoning inherent in interpreting complex instructions [3]. In contrast, our approach can handle linguistic as well as perceptual variations, end-to-end trainable and requires no intermediate supervision. The proposed model uses symbolic reasoning constructs that operate on a latent neural object-centric representation, allowing for deeper reasoning over the input scene. Central to our approach is a modular structure consisting of a hierarchical instruction parser and an action simulator to learn disentangled action representations. Our experiments on a simulated environment with a 7-DOF manipulator, consisting of instructions with varying number of steps and scenes with different number of objects, demonstrate that our model is robust to such variations and significantly outperforms baselines, particularly in the generalization settings. The code, dataset and experiment videos are available at https://nsrmp.github.io

引用

页码：7973 / 7980

页数：8

共 8 条

[1] Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning
Chitnis, Rohan
Silver, Tom
Tenenbaum, Joshua B.
Lozano-Perez, Tomas
Kaelbling, Leslie Pack
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 4166 - 4173
[2] A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
Kroemer, Oliver
Niekum, Scott
Konidaris, George
JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
[3] A robot learning from demonstration framework to perform force-based manipulation tasks
Rozo, Leonel
Jimenez, Pablo
Torras, Carme
INTELLIGENT SERVICE ROBOTICS, 2013, 6 (01) : 33 - 51
[4] Autonomous Tissue Manipulation via Surgical Robot Using Deep Reinforcement Learning and Evolutionary Algorithm
Shahkoo, Amin Abbasi
Abin, Ahmad Ali
IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2023, 5 (01): : 30 - 41
[5] PRF: A Program Reuse Framework for Automated Programming by Learning from Existing Robot Programs
Toner, Tyler
Tilbury, Dawn M.
Barton, Kira
ROBOTICS, 2024, 13 (08)
[6] Social robots in a translanguaging pedagogy: A review to identify opportunities for robot-assisted (language) learning
van den Berghe, Rianne
FRONTIERS IN ROBOTICS AND AI, 2022, 9
[7] Symbolic Play and Novel Noun Learning in Deaf and Hearing Children: Longitudinal Effects of Access to Sound on Early Precursors of Language
Quittner, Alexandra L.
Cejas, Ivette
Wang, Nae-Yuh
Niparko, John K.
Barker, David H.
PLOS ONE, 2016, 11 (05):
[8] Comparison of success rates, learning curves, and inter-subject performance variability of robot-assisted and manual ultrasound-guided nerve block needle guidance in simulation
Morse, J.
Terrasini, N.
Wehbe, M.
Philippona, C.
Zaouter, C.
Cyr, S.
Hemmerling, T. M.
BRITISH JOURNAL OF ANAESTHESIA, 2014, 112 (06) : 1092 - 1097

← 1 →