Learning visual-based deformable object rearrangement with local graph neural networks

被引：0

作者：

Yuhong Deng

Xueqian Wang

Lipeng Chen

机构：

[1] Tencent Robotics X Lab,The Center for Intelligent Control and Telescience

[2] Tsinghua Shenzhen International Graduate School,The Center for Intelligent Control and Telescience

[3] Tsinghua Shenzhen International Graduate School,undefined

来源：

Complex & Intelligent Systems | 2023年 / 9卷

关键词：

Deformable manipulation; Robot learning; Graph neural network; Sim-to-real learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Goal-conditioned rearrangement of deformable objects (e.g. straightening a rope and folding a cloth) is one of the most common deformable manipulation tasks, where the robot needs to rearrange a deformable object into a prescribed goal configuration with only visual observations. These tasks are typically confronted with two main challenges: the high dimensionality of deformable configuration space and the underlying complexity, nonlinearity and uncertainty inherent in deformable dynamics. To address these challenges, we propose a novel representation strategy that can efficiently model the deformable object states with a set of keypoints and their interactions. We further propose local-graph neural network (GNN), a light local GNN learning to jointly model the deformable rearrangement dynamics and infer the optimal manipulation actions (e.g. pick and place) by constructing and updating two dynamic graphs. Both simulated and real experiments have been conducted to demonstrate that the proposed dynamic graph representation shows superior expressiveness in modeling deformable rearrangement dynamics. Our method reaches much higher success rates on a variety of deformable rearrangement tasks (96.3% on average) than state-of-the-art method in simulation experiments. Besides, our method is much more lighter and has a 60% shorter inference time than state-of-the-art methods. We also demonstrate that our method performs well in the multi-task learning scenario and can be transferred to real-world applications with an average success rate of 95% by solely fine tuning a keypoint detector. A supplementary video can be found at https://youtu.be/AhwTQo6fCM0.

引用

页码：5923 / 5936

页数：13

共 50 条

[1] Learning visual-based deformable object rearrangement with local graph neural networks
Deng, Yuhong
Wang, Xueqian
Chen, Lipeng
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (05) : 5923 - 5936
[2] Adversarial Object Rearrangement in Constrained Environments with Heterogeneous Graph Neural Networks
Lou, Xibai
Yu, Houjian
Worobel, Ross
Yang, Yang
Choi, Changhyun
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 1008 - 1015
[3] Salient Object Detection Based on Multiple Graph Neural Networks Collaborative Learning
Liu, Bing
Wang, Tiantian
Gao, Lina
Xu, Mingzhu
Fu, Ping
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (07) : 2561 - 2570
[4] Graph neural networks-based preference learning method for object ranking
Meng, Zhenhua
Lin, Rongheng
Wu, Budan
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2024, 167
[5] Learning Latent Object-Centric Representations for Visual-Based Robot Manipulation
Wang, Yunan
Wang, Jiayu
Li, Yixiao
Hu, Chuxiong
Zhu, Yu
2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 138 - 143
[6] 3D Object retrieval based on non-local graph neural networks
Yin-min Li
Zan Gao
Ya-bin Tao
Li-li Wang
Yan-bing Xue
Multimedia Tools and Applications, 2020, 79 : 34011 - 34027
[7] 3D Object retrieval based on non-local graph neural networks
Li, Yin-min
Gao, Zan
Tao, Ya-bin
Wang, Li-li
Xue, Yan-bing
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (45-46) : 34011 - 34027
[8] Deformable and Occluded Object Tracking via Graph Learning
Han, Wei
Huang, Guang-Bin
Cui, Dongshun
2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 376 - 383
[9] LEARNING OBJECT-CENTERED AUTOTELIC BEHAVIORS WITH GRAPH NEURAL NETWORKS
Akakzia, Ahmed
Sigaud, Olivier
CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
[10] Learning Human-Object Interactions by Graph Parsing Neural Networks
Qi, Siyuan
Wang, Wenguan
Jia, Baoxiong
Shen, Jianbing
Zhu, Song-Chun
COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423

← 1 2 3 4 5 →