Research on 3C compliant assembly strategy method of manipulator based on deep reinforcement learning

被引:0
|
作者
Ma, Hang [1 ]
Zhang, Yuhang [1 ]
Li, Ziyang [1 ]
Zhang, Jiaqi [1 ]
Wu, Xibao [1 ]
Chen, Wenbai [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100101, Peoples R China
关键词
3C assembly task; Reward shaping; Reinforcement learning; Modeling of robotic arm; Physical constraints; DESIGN; STATE;
D O I
10.1016/j.compeleceng.2024.109605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Addressing the issues of existing 3C assembly methods that rely on precise contact state models, low sampling efficiency, and poor safety, this paper proposes a research method for a manipulator-based 3C assembly strategy utilizing deep reinforcement learning. Initially, the study constructs a simulation task for 3C assembly involving a UR manipulator and flexible printed circuits (FPC) buckling within the MuJoCo development environment to mirror real-world assembly conditions. By incorporating a Gaussian distribution-based policy network suitable for continuous action spaces and employing the maximum entropy method to enhance the algorithm's exploratory capabilities, this study develops an efficient method for training autonomous assembly behavior strategies. We have successfully established a 3C assembly simulation environment that accurately simulates key physical parameters such as position, contact force, and torque, modeling the assembly task as a Markov decision process. Considering the semi-flexible nature of FPC, we control the magnitude of adaptive contact force to achieve compliant assembly of FPCs. Comprehensive simulation experiments demonstrate that the SAC algorithm proposed in this study enables the robot to autonomously and obediently complete the 3C assembly tasks, exhibiting good accuracy and stability. The assembly success rate reaches 93 %, and after training with the reinforcement learning strategy, the contact force meets the preset range, achieving the effect of compliant assembly.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Path planning of manipulator based on deep reinforcement learning and screw method
    Wang Y.
    Wang Y.-H.
    Yin Z.-Z.
    Wan P.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2023, 40 (03): : 516 - 524
  • [2] Research on Target Defense Strategy Based on Deep Reinforcement Learning
    Luo, Yuelin
    Gang, Tieqiang
    Chen, Lijie
    IEEE ACCESS, 2022, 10 : 82329 - 82335
  • [3] Deep Reinforcement Learning-Assisted Teaching Strategy for Industrial Robot Manipulator
    Simon, Janos
    Gogolak, Laszlo
    Sarosi, Jozsef
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [4] Multiobjective Battery Charging Strategy Based on Deep Reinforcement Learning
    Xiong, Zheng
    Luo, Biao
    Wang, Bing-Chuan
    Xu, Xiaodong
    Huang, Tingwen
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (03): : 6893 - 6903
  • [5] Research on multidimensional dynamic defense strategy for microservice based on deep reinforcement learning
    Zhou D.
    Chen H.
    He W.
    Cheng G.
    Hu H.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (04): : 50 - 63
  • [6] Robotic Peg-in-Hole Assembly Strategy Research Based on Reinforcement Learning Algorithm
    Li, Shaodong
    Yuan, Xiaogang
    Niu, Jie
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [7] Research on Unmanned Surface Vessel Intrusion Evasion Strategy Based on Deep Reinforcement Learning
    Wu, Changmao
    Chen, Liheng
    2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 1228 - 1233
  • [8] A Stock Trading Strategy Based on Deep Reinforcement Learning
    Khemlichi, Firdaous
    Chougrad, Hiba
    Khamlichi, Youness Idrissi
    El Boushaki, Abdessamad
    Ben Ali, Safae El Haj
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 920 - 928
  • [9] Research on ATO Control Method for Urban Rail Based on Deep Reinforcement Learning
    Chen, Xiaoqiang
    Guo, Xiao
    Meng, Jianjun
    Xu, Ruxun
    Li, Shanshan
    Li, Decang
    IEEE ACCESS, 2023, 11 : 5919 - 5928
  • [10] Research on automatic pilot repetition generation method based on deep reinforcement learning
    Pan, Weijun
    Jiang, Peiyuan
    Li, Yukun
    Wang, Zhuang
    Huang, Junxiang
    FRONTIERS IN NEUROROBOTICS, 2023, 17