Learning Neuro-symbolic Programs for Language Guided Robot Manipulation

被引:4
作者
Namasivayam, K. [1 ]
Singh, Himanshu [1 ]
Bindal, Vishal [1 ]
Tuli, Arnav [1 ]
Agrawal, Vishwajeet [1 ]
Jain, Rahul [1 ]
Singla, Parag [1 ]
Paul, Rohan [1 ]
机构
[1] IIT Delhi, New Delhi, India
来源
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023) | 2023年
关键词
SKILLS;
D O I
10.1109/ICRA48891.2023.10160545
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a natural language instruction and an input scene, our goal is to train a model to output a manipulation program that can be executed by the robot. Prior approaches for this task possess one of the following limitations: (i) rely on hand-coded symbols for concepts limiting generalization beyond those seen during training [1] (ii) infer action sequences from instructions but require dense sub-goal supervision [2] or (iii) lack semantics required for deeper object-centric reasoning inherent in interpreting complex instructions [3]. In contrast, our approach can handle linguistic as well as perceptual variations, end-to-end trainable and requires no intermediate supervision. The proposed model uses symbolic reasoning constructs that operate on a latent neural object-centric representation, allowing for deeper reasoning over the input scene. Central to our approach is a modular structure consisting of a hierarchical instruction parser and an action simulator to learn disentangled action representations. Our experiments on a simulated environment with a 7-DOF manipulator, consisting of instructions with varying number of steps and scenes with different number of objects, demonstrate that our model is robust to such variations and significantly outperforms baselines, particularly in the generalization settings. The code, dataset and experiment videos are available at https://nsrmp.github.io
引用
收藏
页码:7973 / 7980
页数:8
相关论文
共 8 条
  • [1] Learning Neuro-Symbolic Relational Transition Models for Bilevel Planning
    Chitnis, Rohan
    Silver, Tom
    Tenenbaum, Joshua B.
    Lozano-Perez, Tomas
    Kaelbling, Leslie Pack
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 4166 - 4173
  • [2] A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms
    Kroemer, Oliver
    Niekum, Scott
    Konidaris, George
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [3] A robot learning from demonstration framework to perform force-based manipulation tasks
    Rozo, Leonel
    Jimenez, Pablo
    Torras, Carme
    INTELLIGENT SERVICE ROBOTICS, 2013, 6 (01) : 33 - 51
  • [4] Autonomous Tissue Manipulation via Surgical Robot Using Deep Reinforcement Learning and Evolutionary Algorithm
    Shahkoo, Amin Abbasi
    Abin, Ahmad Ali
    IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2023, 5 (01): : 30 - 41
  • [5] PRF: A Program Reuse Framework for Automated Programming by Learning from Existing Robot Programs
    Toner, Tyler
    Tilbury, Dawn M.
    Barton, Kira
    ROBOTICS, 2024, 13 (08)
  • [6] Social robots in a translanguaging pedagogy: A review to identify opportunities for robot-assisted (language) learning
    van den Berghe, Rianne
    FRONTIERS IN ROBOTICS AND AI, 2022, 9
  • [7] Symbolic Play and Novel Noun Learning in Deaf and Hearing Children: Longitudinal Effects of Access to Sound on Early Precursors of Language
    Quittner, Alexandra L.
    Cejas, Ivette
    Wang, Nae-Yuh
    Niparko, John K.
    Barker, David H.
    PLOS ONE, 2016, 11 (05):
  • [8] Comparison of success rates, learning curves, and inter-subject performance variability of robot-assisted and manual ultrasound-guided nerve block needle guidance in simulation
    Morse, J.
    Terrasini, N.
    Wehbe, M.
    Philippona, C.
    Zaouter, C.
    Cyr, S.
    Hemmerling, T. M.
    BRITISH JOURNAL OF ANAESTHESIA, 2014, 112 (06) : 1092 - 1097