Research on 3C compliant assembly strategy method of manipulator based on deep reinforcement learning

被引:0
作者
Ma, Hang [1 ]
Zhang, Yuhang [1 ]
Li, Ziyang [1 ]
Zhang, Jiaqi [1 ]
Wu, Xibao [1 ]
Chen, Wenbai [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100101, Peoples R China
关键词
3C assembly task; Reward shaping; Reinforcement learning; Modeling of robotic arm; Physical constraints; DESIGN; STATE;
D O I
10.1016/j.compeleceng.2024.109605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Addressing the issues of existing 3C assembly methods that rely on precise contact state models, low sampling efficiency, and poor safety, this paper proposes a research method for a manipulator-based 3C assembly strategy utilizing deep reinforcement learning. Initially, the study constructs a simulation task for 3C assembly involving a UR manipulator and flexible printed circuits (FPC) buckling within the MuJoCo development environment to mirror real-world assembly conditions. By incorporating a Gaussian distribution-based policy network suitable for continuous action spaces and employing the maximum entropy method to enhance the algorithm's exploratory capabilities, this study develops an efficient method for training autonomous assembly behavior strategies. We have successfully established a 3C assembly simulation environment that accurately simulates key physical parameters such as position, contact force, and torque, modeling the assembly task as a Markov decision process. Considering the semi-flexible nature of FPC, we control the magnitude of adaptive contact force to achieve compliant assembly of FPCs. Comprehensive simulation experiments demonstrate that the SAC algorithm proposed in this study enables the robot to autonomously and obediently complete the 3C assembly tasks, exhibiting good accuracy and stability. The assembly success rate reaches 93 %, and after training with the reinforcement learning strategy, the contact force meets the preset range, achieving the effect of compliant assembly.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] An obstacles avoidance method for serial manipulator based on reinforcement learning and Artificial Potential Field
    Haoxuan Li
    Daoxiong Gong
    Jianjun Yu
    International Journal of Intelligent Robotics and Applications, 2021, 5 : 186 - 202
  • [32] Acceleration control strategy for aero-engines based on model-free deep reinforcement learning method
    Gao, Wenbo
    Zhou, Xin
    Pan, Muxuan
    Zhou, Wenxiang
    Lu, Feng
    Huang, Jinquan
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 120
  • [33] Deep Reinforcement Learning Based Pricing Strategy of Aggregators Considering Renewable Energy
    Chuang, Yu-Chieh
    Chiu, Wei-Yu
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03): : 499 - 508
  • [34] Regenerative braking strategy based on deep reinforcement learning for an electric mining truck
    Yang W.
    Luo D.
    Zhang W.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2024, 46 (03): : 503 - 513
  • [35] Deep Reinforcement Learning based Multi-user Anti jamming Strategy
    Bi, Yue
    Wu, Yue
    Hua, Cunqing
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [36] Research on Control of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Li, Baoan
    Ship Building of China, 2020, 61 : 14 - 20
  • [37] A Survey of Robot Manipulation Behavior Research Based on Deep Reinforcement Learning
    Chen J.
    Zheng M.
    Jiqiren/Robot, 2022, 44 (02): : 236 - 256
  • [38] An Improved Deep Reinforcement Learning Method for Dispatch Optimization Strategy of Modern Power Systems
    Zhai, Suwei
    Li, Wenyun
    Qiu, Zhenyu
    Zhang, Xinyi
    Hou, Shixi
    ENTROPY, 2023, 25 (03)
  • [39] Reinforcement Learning Strategy Based on Multimodal Representations for High-Precision Assembly Tasks
    Li, Ajian
    Liu, Ruikai
    Yang, Xiansheng
    Lou, Yunjiang
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2021, PT I, 2021, 13013 : 56 - 66
  • [40] Reinforcement Learning Method Based Load shifting strategy with Demand Response
    Jin, Lingwu
    Chen, Zheng
    Li, Jinwei
    Ye, Tao
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 1586 - 1591