Learning visual-based deformable object rearrangement with local graph neural networks

被引:0
|
作者
Yuhong Deng
Xueqian Wang
Lipeng Chen
机构
[1] Tencent Robotics X Lab,The Center for Intelligent Control and Telescience
[2] Tsinghua Shenzhen International Graduate School,The Center for Intelligent Control and Telescience
[3] Tsinghua Shenzhen International Graduate School,undefined
来源
关键词
Deformable manipulation; Robot learning; Graph neural network; Sim-to-real learning;
D O I
暂无
中图分类号
学科分类号
摘要
Goal-conditioned rearrangement of deformable objects (e.g. straightening a rope and folding a cloth) is one of the most common deformable manipulation tasks, where the robot needs to rearrange a deformable object into a prescribed goal configuration with only visual observations. These tasks are typically confronted with two main challenges: the high dimensionality of deformable configuration space and the underlying complexity, nonlinearity and uncertainty inherent in deformable dynamics. To address these challenges, we propose a novel representation strategy that can efficiently model the deformable object states with a set of keypoints and their interactions. We further propose local-graph neural network (GNN), a light local GNN learning to jointly model the deformable rearrangement dynamics and infer the optimal manipulation actions (e.g. pick and place) by constructing and updating two dynamic graphs. Both simulated and real experiments have been conducted to demonstrate that the proposed dynamic graph representation shows superior expressiveness in modeling deformable rearrangement dynamics. Our method reaches much higher success rates on a variety of deformable rearrangement tasks (96.3% on average) than state-of-the-art method in simulation experiments. Besides, our method is much more lighter and has a 60% shorter inference time than state-of-the-art methods. We also demonstrate that our method performs well in the multi-task learning scenario and can be transferred to real-world applications with an average success rate of 95% by solely fine tuning a keypoint detector. A supplementary video can be found at https://youtu.be/AhwTQo6fCM0.
引用
收藏
页码:5923 / 5936
页数:13
相关论文
共 50 条
  • [1] Learning visual-based deformable object rearrangement with local graph neural networks
    Deng, Yuhong
    Wang, Xueqian
    Chen, Lipeng
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 5923 - 5936
  • [2] Adversarial Object Rearrangement in Constrained Environments with Heterogeneous Graph Neural Networks
    Lou, Xibai
    Yu, Houjian
    Worobel, Ross
    Yang, Yang
    Choi, Changhyun
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1008 - 1015
  • [3] Salient Object Detection Based on Multiple Graph Neural Networks Collaborative Learning
    Liu, Bing
    Wang, Tiantian
    Gao, Lina
    Xu, Mingzhu
    Fu, Ping
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (07) : 2561 - 2570
  • [4] Graph neural networks-based preference learning method for object ranking
    Meng, Zhenhua
    Lin, Rongheng
    Wu, Budan
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2024, 167
  • [5] Learning Latent Object-Centric Representations for Visual-Based Robot Manipulation
    Wang, Yunan
    Wang, Jiayu
    Li, Yixiao
    Hu, Chuxiong
    Zhu, Yu
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 138 - 143
  • [6] 3D Object retrieval based on non-local graph neural networks
    Yin-min Li
    Zan Gao
    Ya-bin Tao
    Li-li Wang
    Yan-bing Xue
    Multimedia Tools and Applications, 2020, 79 : 34011 - 34027
  • [7] 3D Object retrieval based on non-local graph neural networks
    Li, Yin-min
    Gao, Zan
    Tao, Ya-bin
    Wang, Li-li
    Xue, Yan-bing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34011 - 34027
  • [8] Deformable and Occluded Object Tracking via Graph Learning
    Han, Wei
    Huang, Guang-Bin
    Cui, Dongshun
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 376 - 383
  • [9] LEARNING OBJECT-CENTERED AUTOTELIC BEHAVIORS WITH GRAPH NEURAL NETWORKS
    Akakzia, Ahmed
    Sigaud, Olivier
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [10] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423