Research on 3C compliant assembly strategy method of manipulator based on deep reinforcement learning

被引:0
|
作者
Ma, Hang [1 ]
Zhang, Yuhang [1 ]
Li, Ziyang [1 ]
Zhang, Jiaqi [1 ]
Wu, Xibao [1 ]
Chen, Wenbai [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100101, Peoples R China
关键词
3C assembly task; Reward shaping; Reinforcement learning; Modeling of robotic arm; Physical constraints; DESIGN; STATE;
D O I
10.1016/j.compeleceng.2024.109605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Addressing the issues of existing 3C assembly methods that rely on precise contact state models, low sampling efficiency, and poor safety, this paper proposes a research method for a manipulator-based 3C assembly strategy utilizing deep reinforcement learning. Initially, the study constructs a simulation task for 3C assembly involving a UR manipulator and flexible printed circuits (FPC) buckling within the MuJoCo development environment to mirror real-world assembly conditions. By incorporating a Gaussian distribution-based policy network suitable for continuous action spaces and employing the maximum entropy method to enhance the algorithm's exploratory capabilities, this study develops an efficient method for training autonomous assembly behavior strategies. We have successfully established a 3C assembly simulation environment that accurately simulates key physical parameters such as position, contact force, and torque, modeling the assembly task as a Markov decision process. Considering the semi-flexible nature of FPC, we control the magnitude of adaptive contact force to achieve compliant assembly of FPCs. Comprehensive simulation experiments demonstrate that the SAC algorithm proposed in this study enables the robot to autonomously and obediently complete the 3C assembly tasks, exhibiting good accuracy and stability. The assembly success rate reaches 93 %, and after training with the reinforcement learning strategy, the contact force meets the preset range, achieving the effect of compliant assembly.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Deep-Reinforcement-Learning-Based Intelligent Routing Strategy for FANETs
    Lin, Deping
    Peng, Tao
    Zuo, Peiliang
    Wang, Wenbo
    SYMMETRY-BASEL, 2022, 14 (09):
  • [22] Research on epidemic tracking method based on reinforcement learning
    Guo, Siyuan
    Yan, Huaicheng
    Li, Yue
    Ke, Bai
    Li Zhichen
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1335 - 1340
  • [23] An Intersection Signal Control Method Based on Deep Reinforcement Learning
    Pang Ha-li
    Ding Ke
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 344 - 348
  • [24] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [25] An obstacles avoidance method for serial manipulator based on reinforcement learning and Artificial Potential Field
    Haoxuan Li
    Daoxiong Gong
    Jianjun Yu
    International Journal of Intelligent Robotics and Applications, 2021, 5 : 186 - 202
  • [26] An obstacles avoidance method for serial manipulator based on reinforcement learning and Artificial Potential Field
    Li, Haoxuan
    Gong, Daoxiong
    Yu, Jianjun
    INTERNATIONAL JOURNAL OF INTELLIGENT ROBOTICS AND APPLICATIONS, 2021, 5 (02) : 186 - 202
  • [27] Learning to Drive Like Human Beings: A Method Based on Deep Reinforcement Learning
    Tian, Yantao
    Cao, Xuanhao
    Huang, Kai
    Fei, Cong
    Zheng, Zhu
    Ji, Xuewu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 6357 - 6367
  • [28] Research on a Task Offloading Strategy for the Internet of Vehicles Based on Reinforcement Learning
    Xiao, Shuo
    Wang, Shengzhi
    Zhuang, Jiayu
    Wang, Tianyu
    Liu, Jiajia
    SENSORS, 2021, 21 (18)
  • [29] Research of Control Strategy of Power System Stabilizer Based on Reinforcement Learning
    Zhu, Xingyu
    Jin, Tao
    2020 IEEE 2ND INTERNATIONAL CONFERENCE ON CIRCUITS AND SYSTEMS (ICCS 2020), 2020, : 81 - 85
  • [30] Acceleration control strategy for aero-engines based on model-free deep reinforcement learning method
    Gao, Wenbo
    Zhou, Xin
    Pan, Muxuan
    Zhou, Wenxiang
    Lu, Feng
    Huang, Jinquan
    AEROSPACE SCIENCE AND TECHNOLOGY, 2022, 120