Learning visual-based deformable object rearrangement with local graph neural networks

被引:0
|
作者
Yuhong Deng
Xueqian Wang
Lipeng Chen
机构
[1] Tencent Robotics X Lab,The Center for Intelligent Control and Telescience
[2] Tsinghua Shenzhen International Graduate School,The Center for Intelligent Control and Telescience
[3] Tsinghua Shenzhen International Graduate School,undefined
来源
关键词
Deformable manipulation; Robot learning; Graph neural network; Sim-to-real learning;
D O I
暂无
中图分类号
学科分类号
摘要
Goal-conditioned rearrangement of deformable objects (e.g. straightening a rope and folding a cloth) is one of the most common deformable manipulation tasks, where the robot needs to rearrange a deformable object into a prescribed goal configuration with only visual observations. These tasks are typically confronted with two main challenges: the high dimensionality of deformable configuration space and the underlying complexity, nonlinearity and uncertainty inherent in deformable dynamics. To address these challenges, we propose a novel representation strategy that can efficiently model the deformable object states with a set of keypoints and their interactions. We further propose local-graph neural network (GNN), a light local GNN learning to jointly model the deformable rearrangement dynamics and infer the optimal manipulation actions (e.g. pick and place) by constructing and updating two dynamic graphs. Both simulated and real experiments have been conducted to demonstrate that the proposed dynamic graph representation shows superior expressiveness in modeling deformable rearrangement dynamics. Our method reaches much higher success rates on a variety of deformable rearrangement tasks (96.3% on average) than state-of-the-art method in simulation experiments. Besides, our method is much more lighter and has a 60% shorter inference time than state-of-the-art methods. We also demonstrate that our method performs well in the multi-task learning scenario and can be transferred to real-world applications with an average success rate of 95% by solely fine tuning a keypoint detector. A supplementary video can be found at https://youtu.be/AhwTQo6fCM0.
引用
收藏
页码:5923 / 5936
页数:13
相关论文
共 50 条
  • [21] Visual-based and object-conscious image retrieval by block reallocation into object region
    Mochizuki, Takahiro
    Kawai, Yoshihiko
    Sano, Masanori
    Sumiyoshi, Hideki
    Fujii, Mahito
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2016, 11 : S44 - S52
  • [22] Visual-Based Human Crowds Behavior Analysis Based on Graph Modeling and Matching
    Chen, Duan-Yu
    Huang, Po-Chung
    IEEE SENSORS JOURNAL, 2013, 13 (06) : 2129 - 2138
  • [23] Event-based Object Detection using Graph Neural Networks
    Sun, Daobo
    Ji, Haibo
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1895 - 1900
  • [24] Neural Networks Regularization With Graph-Based Local Resampling
    Assis, Alex D.
    Torres, Luiz C. B.
    Araujo, Lourencro R. G.
    Hanriot, Vitor M.
    Braga, Antonio P.
    IEEE ACCESS, 2021, 9 : 50727 - 50737
  • [25] Learning graph normalization for graph neural networks
    Chen, Yihao
    Tang, Xin
    Qi, Xianbiao
    Li, Chun-Guang
    Xiao, Rong
    NEUROCOMPUTING, 2022, 493 : 613 - 625
  • [26] Learning Graph Matching with Graph Neural Networks
    Dobler, Kalvin
    Riesen, Kaspar
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2024, 2024, 15154 : 3 - 12
  • [27] Local Augmentation for Graph Neural Networks
    Liu, Songtao
    Ying, Rex
    Dong, Hanze
    Li, Lanqing
    Xu, Tingyang
    Rong, Yu
    Zhao, Peilin
    Huang, Junzhou
    Wu, Dinghao
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [28] The neural basis of visual object learning
    Op de Beeck, Hans P.
    Baker, Chris I.
    TRENDS IN COGNITIVE SCIENCES, 2010, 14 (01) : 22 - 30
  • [29] Visual-based interface using hand gesture recognition and object tracking
    Chalechale, A.
    Safael, F.
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION B-ENGINEERING, 2008, 32 (B3): : 279 - 293
  • [30] Graph Based Visual Object Tracking
    Zhou Guanling
    Wang Yuping
    Dong Nanping
    2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL I, 2009, : 99 - 102