Affordance-based robot object retrieval

被引:4
|
作者
Thao Nguyen [1 ]
Gopalan, Nakul [1 ,4 ]
Patel, Roma [1 ]
Corsaro, Matt [2 ]
Pavlick, Ellie [3 ]
Tellex, Stefanie [3 ]
机构
[1] Brown Univ, Providence, RI 02912 USA
[2] Brown Univ, Comp Sci, George Konidariss Intelligent Robot Lab, Providence, RI 02912 USA
[3] Brown Univ, Comp Sci, Providence, RI 02912 USA
[4] Georgia Inst Technol, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
Robots;
D O I
10.1007/s10514-021-10008-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language object retrieval is a highly useful yet challenging task for robots in human-centric environments. Previous work has primarily focused on commands specifying the desired object's type such as "scissors" and/or visual attributes such as "red," thus limiting the robot to only known object classes. We develop a model to retrieve objects based on descriptions of their usage. The model takes in a language command containing a verb, for example "Hand me something to cut," and RGB images of candidate objects; and outputs the object that best satisfies the task specified by the verb. Our model directly predicts an object's appearance from the object's use specified by a verb phrase, without needing an object's class label. Based on contextual information present in the language commands, our model can generalize to unseen object classes and unknown nouns in the commands. Our model correctly selects objects out of sets of five candidates to fulfill natural language commands, and achieves a mean reciprocal rank of 77.4% on a held-out test set of unseen ImageNet object classes and 69.1% on unseen object classes and unknown nouns. Our model also achieves a mean reciprocal rank of 71.8% on unseen YCB object classes, which have a different image distribution from ImageNet. We demonstrate our model on a KUKA LBR iiwa robot arm, enabling the robot to retrieve objects based on natural language descriptions of their usage (Video recordings of the robot demonstrations can be found at ). We also present a new dataset of 655 verb-object pairs denoting object usage over 50 verbs and 216 object classes (The dataset and code for the project can be found at https://github.com/Thaonguyen3095/affordance- language).
引用
收藏
页码:83 / 98
页数:16
相关论文
共 50 条
  • [1] TARS: Tactile Affordance in Robot Synesthesia for Dexterous Manipulation
    Wu, Qiwei
    Wang, Haidong
    Zhou, Jiayu
    Xiong, Xiaogang
    Lou, Yunjiang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 327 - 334
  • [2] Object Pose Estimation From RGB-D Images With Affordance-Instance Segmentation Constraint for Semantic Robot Manipulation
    Wang, Zhongli
    Tian, Guohui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 595 - 602
  • [3] Affordance Matching From The Shared Information In Multi-robot
    Yi, Chang'an
    Min, Huaqing
    Luo, Ronghua
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 66 - 71
  • [4] Research on Object Model-Based Architecture for Service Robot System
    邵鹏鸣
    李成刚
    吴翰声
    Journal of Southwest Jiaotong University, 2002, (01) : 21 - 32
  • [5] Proactive Robot Assistance: Affordance-Aware Augmented Reality User Interfaces
    Quesada, Rodrigo Chacon
    Demiris, Yiannis
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2022, 29 (01) : 22 - 34
  • [6] Visuo-Tactile Feedback-Based Robot Manipulation for Object Packing
    Liang, Wenyu
    Fang, Fen
    Acar, Cihan
    Toh, Wei Qi
    Sun, Ying
    Xu, Qianli
    Wu, Yan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 1151 - 1158
  • [7] Human-to-Robot Handover Control of an Autonomous Mobile Robot Based on Hand-Masked Object Pose Estimation
    Huang, Yu-Yun
    Song, Kai-Tai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7851 - 7858
  • [8] Task-Oriented Robot Cognitive Manipulation Planning Using Affordance Segmentation and Logic Reasoning
    Wang, Zhongli
    Tian, Guohui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12172 - 12185
  • [9] DETERMINING ROBOT LOCATION BY CYLINDRICAL OBJECT SHAPES
    LEE, JD
    CHEN, CH
    LEE, JY
    YOU, YC
    ELECTRONICS LETTERS, 1992, 28 (11) : 1044 - 1045
  • [10] Design and Trajectory Optimization of an Object Loading Robot
    Zheng, Yu
    Guang, Chenhan
    Li, Qian
    Yang, Yang
    Binggong Xuebao/Acta Armamentarii, 2020, 41 (04): : 763 - 770