State Representation Learning for Task and Motion Planning in Robot Manipulation

被引:0
作者
Qu Weiming [1 ]
Wei Yaoyao [1 ]
Luo Dingsheng [1 ]
机构
[1] Peking Univ, Sch Intelligence Sci & Technol, Natl Key Lab Gen Artificial Intelligence, Key Lab Machine Percept MoE, Beijing 100871, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL | 2023年
关键词
state representation learning; environment model; task and motion planning; robot manipulation;
D O I
10.1109/ICDL55364.2023.10364419
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Employing the knowledge representation model designed by human experts has long been the dominant methodology in task and motion planning (TAMP). However, this type of method is time-consuming and suffers from the domain-dependence problem. In this paper, we focus on TAMP of robot arm manipulation based on state representation learning. We present a state representation learning method and a joint learning strategy for both the state representation model and the environment model, enabling the robot to learn the environment model autonomously, thereby mitigating the issue of domain-dependence. To improve planning efficiency and task success rate, we also incorporate a search pruning strategy based on value function learning and a re-planning method based on Model Predictive Control (MPC). The proposed method is evaluated in the simulation and real-robot experiments and shown to be effective compared to current TAMP systems.
引用
收藏
页码:93 / 99
页数:7
相关论文
共 30 条
[1]  
Rusu AA, 2016, Arxiv, DOI [arXiv:1606.04671, 10.48550/arXiv.1606.04671, DOI 10.48550/ARXIV.1606.04671, DOI 10.43550/ARXIV:1606.04671]
[2]  
Canny J, 1987, ACM-MIT Press Doctoral Dissertation Award Series
[3]  
Casalino A, 2019, IEEE INT C INT ROBOT, P1510, DOI [10.1109/iros40897.2019.8967578, 10.1109/IROS40897.2019.8967578]
[4]   Long-Horizon Manipulation of Unknown Objects via Task and Motion Planning with Estimated Affordances [J].
Curtis, Aidan ;
Fang, Xiaolin ;
Kaelbling, Leslie Pack ;
Lozano-Perez, Tomas ;
Garrett, Caelan Reed .
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, :1940-1946
[5]  
Dean TL, 1991, Planning and control
[6]   Learning to Ground Objects for Robot Task and Motion Planning [J].
Ding, Yan ;
Zhang, Xiaohan ;
Zhan, Xingyue ;
Zhang, Shiqi .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) :5536-5543
[7]  
EROL K, 1994, PROCEEDINGS OF THE TWELFTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P1123
[8]  
Gammell JD, 2014, IEEE INT C INT ROBOT, P2997, DOI 10.1109/IROS.2014.6942976
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Höfer S, 2016, 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), P3893, DOI 10.1109/IROS.2016.7759573