Using Goal-Conditioned Reinforcement Learning With Deep Imitation to Control Robot Arm in Flexible Flat Cable Assembly Task

被引：5

作者：

Li, Jingchen ^{[1
]}

Shi, Haobin ^{[1
]}

Hwang, Kao-Shing ^{[2
,3
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China

[2] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 81164, Taiwan

[3] Kaohsiung Med Univ, Dept Healthcare Adm & Med Informat, Kaohsiung 80708, Taiwan

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年 / 21卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Robots; Manipulators; Reinforcement learning; Task analysis; Connectors; Service robots; Production; Deep reinforcement learning; robot arm; intelligent assembly;

D O I：

10.1109/TASE.2023.3323307

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Leveraging reinforcement learning on high-precision decision-making in Robot Arm assembly scenes is a desired goal in the industrial community. However, tasks like Flexible Flat Cable (FFC) assembly, which require highly trained workers, pose significant challenges due to sparse rewards and limited learning conditions. In this work, we propose a goal-conditioned self-imitation reinforcement learning method for FFC assembly without relying on a specific end-effector, where both perception and behavior plannings are learned through reinforcement learning. We analyze the challenges faced by Robot Arm in high-precision assembly scenarios and balance the breadth and depth of exploration during training. Our end-to-end model consists of hindsight and self-imitation modules, allowing the Robot Arm to leverage futile exploration and optimize successful trajectories. Our method does not require rule-based or manual rewards, and it enables the Robot Arm to quickly find feasible solutions through experience relabeling, while unnecessary explorations are avoided. We train the FFC assembly policy in a simulation environment and transfer it to the real scenario by using domain adaptation. We explore various combinations of hindsight and self-imitation learning, and discuss the results comprehensively. Experimental findings demonstrate that our model achieves fast and advanced flexible flat cable assembly, surpassing other reinforcement learning-based methods.Note to Practitioners-The motivation of this article stems from the need to develop an efficient and accurate FFC assembly policy for 3C (Computer, Communication, and Consumer Electronic) industry, promoting the development of intelligent manufacturing. Traditional control methods are incompetent to complete such a high-precision task with Robot Arm due to the difficult-to-model connectors, and existing reinforcement learning methods cannot converge with restricted epochs because of the difficult goals or trajectories. To quickly learn a high-quality assembly for Robot Arm and accelerate the convergence speed, we combine the goal-conditioned reinforcement learning and self-imitation mechanism, balancing the depth and breadth of exploration. The proposal takes visual information and six-dimensions force as state, obtaining satisfactory assembly policies. We build a simulation scene by the Pybullet platform and pre-train the Robot Arm on it, and then the pre-trained policies can be reused in real scenarios with finetuning.

引用

页码：6217 / 6228

页数：12

共 50 条

[21] Robot skill acquisition in assembly process using deep reinforcement learning
Li, Fengming
Jiang, Qi
Zhang, Sisi
Wei, Meng
Song, Rui
NEUROCOMPUTING, 2019, 345 : 92 - 102
[22] A Task-Adaptive Deep Reinforcement Learning Framework for Dual-Arm Robot Manipulation
Cui, Yuanzhe
Xu, Zhipeng
Zhong, Lou
Xu, Pengjie
Shen, Yichao
Tang, Qirong
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 466 - 479
[23] Deep Reinforcement Learning for Concentric Tube Robot Control with a Goal-Based Curriculum
Iyengar, Keshav
Stoyanov, Danail
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1459 - 1465
[24] Robot Arm Control Method of Moving Below Object Based on Deep Reinforcement Learning
Li, HeYu
Guo, LiQin
Shi, GuoQiang
Xiao, YingYing
Zeng, Bi
Lin, TingYu
Jia, ZhengXuan
METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2019, 1094 : 127 - 136
[25] Decision making on robot with multi-task using deep reinforcement learning for each task
Shimoguchi, Yuya
Kurashige, Kentarou
2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3460 - 3465
[26] Position Control of Cable-Driven Robotic Soft Arm Based on Deep Reinforcement Learning
Wu, Qiuxuan
Gu, Yueqin
Li, Yancheng
Zhang, Botao
Chepinskiy, Sergey A.
Wang, Jian
Zhilenkov, Anton A.
Krasnov, Aleksandr Y.
Chernyi, Sergei
INFORMATION, 2020, 11 (06)
[27] Continuous Control of a Soft Continuum Arm using Deep Reinforcement Learning
Satheeshbabu, Sreeshankar
Uppalapati, Naveen K.
Fu, Tianshi
Krishnan, Girish
2020 3RD IEEE INTERNATIONAL CONFERENCE ON SOFT ROBOTICS (ROBOSOFT), 2020, : 497 - 503
[28] Generalizable Human-Robot Collaborative Assembly Using Imitation Learning and Force Control
Jha, Devesh K.
Jain, Siddarth
Romeres, Diego
Yerazunis, William
Nikovski, Daniel
2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
[29] Human-robot collaborative assembly task planning for mobile cobots based on deep reinforcement learning
Hou, Wenbin
Xiong, Zhihua
Yue, Ming
Chen, Hao
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2024, 238 (23) : 11097 - 11114
[30] Position control of a planar cable-driven parallel robot using reinforcement learning
Sancak, Caner
Yamac, Fatma
Itik, Mehmet
ROBOTICA, 2022, 40 (10) : 3378 - 3395

← 1 2 3 4 5 →