Research on 3C compliant assembly strategy method of manipulator based on deep reinforcement learning

被引：0

作者：

Ma, Hang ^{[1
]}

Zhang, Yuhang ^{[1
]}

Li, Ziyang ^{[1
]}

Zhang, Jiaqi ^{[1
]}

Wu, Xibao ^{[1
]}

Chen, Wenbai ^{[1
]}

机构：

[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100101, Peoples R China

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2024年 / 119卷

关键词：

3C assembly task; Reward shaping; Reinforcement learning; Modeling of robotic arm; Physical constraints; DESIGN; STATE;

D O I：

10.1016/j.compeleceng.2024.109605

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Addressing the issues of existing 3C assembly methods that rely on precise contact state models, low sampling efficiency, and poor safety, this paper proposes a research method for a manipulator-based 3C assembly strategy utilizing deep reinforcement learning. Initially, the study constructs a simulation task for 3C assembly involving a UR manipulator and flexible printed circuits (FPC) buckling within the MuJoCo development environment to mirror real-world assembly conditions. By incorporating a Gaussian distribution-based policy network suitable for continuous action spaces and employing the maximum entropy method to enhance the algorithm's exploratory capabilities, this study develops an efficient method for training autonomous assembly behavior strategies. We have successfully established a 3C assembly simulation environment that accurately simulates key physical parameters such as position, contact force, and torque, modeling the assembly task as a Markov decision process. Considering the semi-flexible nature of FPC, we control the magnitude of adaptive contact force to achieve compliant assembly of FPCs. Comprehensive simulation experiments demonstrate that the SAC algorithm proposed in this study enables the robot to autonomously and obediently complete the 3C assembly tasks, exhibiting good accuracy and stability. The assembly success rate reaches 93 %, and after training with the reinforcement learning strategy, the contact force meets the preset range, achieving the effect of compliant assembly.

引用

页数：14

共 50 条

[21] Deep-Reinforcement-Learning-Based Intelligent Routing Strategy for FANETs
Lin, Deping
Peng, Tao
Zuo, Peiliang
Wang, Wenbo
SYMMETRY-BASEL, 2022, 14 (09):
[22] Research on epidemic tracking method based on reinforcement learning
Guo, Siyuan
Yan, Huaicheng
Li, Yue
Ke, Bai
Li Zhichen
2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1335 - 1340
[23] An Intersection Signal Control Method Based on Deep Reinforcement Learning
Pang Ha-li
Ding Ke
2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 344 - 348
[24] A UAV Path Planning Method Based on Deep Reinforcement Learning
Li, Yibing
Zhang, Sitong
Ye, Fang
Jiang, Tao
Li, Yingsong
2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
[25] An obstacles avoidance method for serial manipulator based on reinforcement learning and Artificial Potential Field
Haoxuan Li
Daoxiong Gong
Jianjun Yu
International Journal of Intelligent Robotics and Applications, 2021, 5 : 186 - 202
[26] An obstacles avoidance method for serial manipulator based on reinforcement learning and Artificial Potential Field
Li, Haoxuan
Gong, Daoxiong
Yu, Jianjun
INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2021, 5 (02) : 186 - 202
[27] Learning to Drive Like Human Beings: A Method Based on Deep Reinforcement Learning
Tian, Yantao
Cao, Xuanhao
Huang, Kai
Fei, Cong
Zheng, Zhu
Ji, Xuewu
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 6357 - 6367
[28] Research on a Task Offloading Strategy for the Internet of Vehicles Based on Reinforcement Learning
Xiao, Shuo
Wang, Shengzhi
Zhuang, Jiayu
Wang, Tianyu
Liu, Jiajia
SENSORS, 2021, 21 (18)
[29] Research of Control Strategy of Power System Stabilizer Based on Reinforcement Learning
Zhu, Xingyu
Jin, Tao
2020 IEEE 2ND INTERNATIONAL CONFERENCE ON CIRCUITS AND SYSTEMS (ICCS 2020), 2020, : 81 - 85
[30] Acceleration control strategy for aero-engines based on model-free deep reinforcement learning method
Gao, Wenbo
Zhou, Xin
Pan, Muxuan
Zhou, Wenxiang
Lu, Feng
Huang, Jinquan
AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 120

← 1 2 3 4 5 →