Research on 3C compliant assembly strategy method of manipulator based on deep reinforcement learning

被引:0
作者
Ma, Hang [1 ]
Zhang, Yuhang [1 ]
Li, Ziyang [1 ]
Zhang, Jiaqi [1 ]
Wu, Xibao [1 ]
Chen, Wenbai [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100101, Peoples R China
关键词
3C assembly task; Reward shaping; Reinforcement learning; Modeling of robotic arm; Physical constraints; DESIGN; STATE;
D O I
10.1016/j.compeleceng.2024.109605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Addressing the issues of existing 3C assembly methods that rely on precise contact state models, low sampling efficiency, and poor safety, this paper proposes a research method for a manipulator-based 3C assembly strategy utilizing deep reinforcement learning. Initially, the study constructs a simulation task for 3C assembly involving a UR manipulator and flexible printed circuits (FPC) buckling within the MuJoCo development environment to mirror real-world assembly conditions. By incorporating a Gaussian distribution-based policy network suitable for continuous action spaces and employing the maximum entropy method to enhance the algorithm's exploratory capabilities, this study develops an efficient method for training autonomous assembly behavior strategies. We have successfully established a 3C assembly simulation environment that accurately simulates key physical parameters such as position, contact force, and torque, modeling the assembly task as a Markov decision process. Considering the semi-flexible nature of FPC, we control the magnitude of adaptive contact force to achieve compliant assembly of FPCs. Comprehensive simulation experiments demonstrate that the SAC algorithm proposed in this study enables the robot to autonomously and obediently complete the 3C assembly tasks, exhibiting good accuracy and stability. The assembly success rate reaches 93 %, and after training with the reinforcement learning strategy, the contact force meets the preset range, achieving the effect of compliant assembly.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Research on Method of Intelligent Radar Confrontation Based on Reinforcement Learning
    Xing Qiang
    Zhu Weigang
    Jia Xin
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 471 - 475
  • [42] Generalization ability of hybrid electric vehicle energy management strategy based on reinforcement learning method
    Qi, Chunyang
    Song, Chuanxue
    Xiao, Feng
    Song, Shixin
    ENERGY, 2022, 250
  • [43] Deep Reinforcement Learning-Based Method of Mobile Data Offloading
    Mochizuki, Daisuke
    Abiko, Yu
    Mineno, Hiroshi
    Saito, Takato
    Ikeda, Daizo
    Katagiri, Masaji
    2018 ELEVENTH INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND UBIQUITOUS NETWORK (ICMU 2018), 2018,
  • [44] A Bus Signal Priority Control Method Based on Deep Reinforcement Learning
    Shen, Wenchao
    Zou, Liang
    Deng, Ruisheng
    Wu, Hongyu
    Wu, Jiabin
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [45] A Satellite Adaptive Modulation Coding Method Based on Deep Reinforcement Learning
    Zhou, Xin
    Li, Wenfeng
    Zhao, Kanglian
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE INNOVATION, ICAII 2023, 2023, : 83 - 88
  • [46] An active equalization method for redundant battery based on deep reinforcement learning
    Lu, Chenlei
    Chen, Jianlong
    Chen, Cong
    Huang, Yin
    Xuan, Dongji
    MEASUREMENT, 2023, 210
  • [47] Adaptive Virtual Machine Consolidation Method Based on Deep Reinforcement Learning
    Yu X.
    Li Z.
    Sun S.
    Zhang G.
    Diao Z.
    Xie G.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (12): : 2783 - 2797
  • [48] Emergency communication network planning method based on deep reinforcement learning
    Yin C.
    Yang R.
    Zhu W.
    Zou X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (09): : 2091 - 2097
  • [49] An Intellectual Aerodynamic Design Method for Compressors Based on Deep Reinforcement Learning
    Xu, Xiaohan
    Huang, Xudong
    Bi, Dianfang
    Zhou, Ming
    AEROSPACE, 2023, 10 (02)
  • [50] Design Method of Infrared Stealth Film Based on Deep Reinforcement Learning
    Zhang, Kunyuan
    Liu, Delian
    Yang, Shuo
    PHOTONICS, 2025, 12 (01)