State Representation Learning for Task and Motion Planning in Robot Manipulation

被引：0

作者：

Qu Weiming ^{[1
]}

Wei Yaoyao ^{[1
]}

Luo Dingsheng ^{[1
]}

机构：

[1] Peking Univ, Sch Intelligence Sci & Technol, Natl Key Lab Gen Artificial Intelligence, Key Lab Machine Percept MoE, Beijing 100871, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL | 2023年

关键词：

state representation learning; environment model; task and motion planning; robot manipulation;

D O I：

10.1109/ICDL55364.2023.10364419

中图分类号：

B84 [心理学]; C [社会科学总论]; Q98 [人类学];

学科分类号：

03 ; 0303 ; 030303 ; 04 ; 0402 ;

摘要：

Employing the knowledge representation model designed by human experts has long been the dominant methodology in task and motion planning (TAMP). However, this type of method is time-consuming and suffers from the domain-dependence problem. In this paper, we focus on TAMP of robot arm manipulation based on state representation learning. We present a state representation learning method and a joint learning strategy for both the state representation model and the environment model, enabling the robot to learn the environment model autonomously, thereby mitigating the issue of domain-dependence. To improve planning efficiency and task success rate, we also incorporate a search pruning strategy based on value function learning and a re-planning method based on Model Predictive Control (MPC). The proposed method is evaluated in the simulation and real-robot experiments and shown to be effective compared to current TAMP systems.

引用

页码：93 / 99

页数：7

共 30 条

[1]

Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, 10.48550/arXiv.1606.04671, DOI 10.48550/ARXIV.1606.04671, DOI 10.43550/ARXIV:1606.04671]

[2]

Canny J, 1987, ACM-MIT Press Doctoral Dissertation Award Series

[3]

Casalino A, 2019, IEEE INT C INT ROBOT, P1510, DOI [10.1109/iros40897.2019.8967578, 10.1109/IROS40897.2019.8967578]

[4] Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances [J].

Curtis, Aidan ;

Fang, Xiaolin ;

Kaelbling, Leslie Pack ;

Lozano-Perez, Tomas ;

Garrett, Caelan Reed .

2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, :1940-1946

[5]

Dean TL, 1991, Planning and control

[6] Learning to Ground Objects for Robot Task and Motion Planning [J].

Ding, Yan ;

Zhang, Xiaohan ;

Zhan, Xingyue ;

Zhang, Shiqi .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) :5536-5543

[7]

EROL K, 1994, PROCEEDINGS OF THE TWELFTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P1123

[8]

Gammell JD, 2014, IEEE INT C INT ROBOT, P2997, DOI 10.1109/IROS.2014.6942976

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10]

Höfer S, 2016, 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), P3893, DOI 10.1109/IROS.2016.7759573

← 1 2 3 →