Cable SCARA Robot Controlled by a Neural Network Using Reinforcement Learning

被引:0
作者
Okabe, Eduardo [1 ]
Paiva, Victor [2 ]
Silva-Teixeira, Luis H. [3 ]
Izuka, Jaime [4 ]
机构
[1] Univ Estadual Campinas, Sch Appl Sci, Rua Pedro Zaccaria 1300, BR-13484350 Limeira, Brazil
[2] Univ Estadual Campinas, Sch Mech Engn, Dept Integrated Syst, Rua Mendeleyev 200, BR-13083860 Campinas, Brazil
[3] Univ Estadual Campinas, Sch Mech Engn, Dept Integrated Syst, R Mendeleyev 200, BR-13083860 Campinas, SP, Brazil
[4] Univ Estadual Campinas, Sch Appl Sci, Rua Pedro Zaccaria 1300, BR-13484350 Limeira, SP, Brazil
来源
JOURNAL OF COMPUTATIONAL AND NONLINEAR DYNAMICS | 2023年 / 18卷 / 10期
关键词
Compendex;
D O I
10.1115/1.4063222
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
In this work, three reinforcement learning algorithms (Proximal Policy Optimization, Soft Actor-Critic, and Twin Delayed Deep Deterministic Policy Gradient) are employed to control a two link selective compliance articulated robot arm (SCARA) robot. This robot has three cables attached to its end-effector, which creates a triangular shaped workspace. Positioning the end-effector in the workspace is a relatively simple kinematic problem, but moving outside this region, although possible, requires a nonlinear dynamic model and a state-of-the-art controller. To solve this problem in a simple manner, reinforcement learning algorithms are used to find possible trajectories for three targets out of the workspace. Additionally, the SCARA mechanism offers two possible configurations for each end-effector position. The algorithm results are compared in terms of displacement error, velocity, and standard deviation among ten trajectories provided by the trained network. The results indicate the Proximal Policy Algorithm as the most consistent in the analyzed situations. Still, the Soft Actor-Critic presented better solutions, and Twin Delayed Deep Deterministic Policy Gradient provided interesting and more unusual trajectories.
引用
收藏
页数:7
相关论文
共 21 条
  • [1] Wrench-feasible workspace generation for cable-driven robots
    Bosscher, Paul
    Riechel, Andrew T.
    Ebert-Uphoff, Imme
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2006, 22 (05) : 890 - 902
  • [2] Brockman G, 2016, Arxiv, DOI [arXiv:1606.01540, DOI 10.48550/ARXIV.1606.01540]
  • [3] Mathematical modelling, simulation and experimental verification of a scara robot
    Das, MT
    Dülger, LC
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2005, 13 (03) : 257 - 271
  • [4] Ebert David S, 2003, Texturing & Modeling: A Procedural Approach
  • [5] Fujimoto S, 2018, PR MACH LEARN RES, V80
  • [6] Synthesis of the Inverse Kinematic Model of Non-Redundant Open-Chain Robotic Systems Using Groebner Basis Theory
    Guzman-Gimenez, Jose
    Valera Fernandez, Angel
    Mata Amela, Vicente
    Angel Diaz-Rodriguez, Miguel
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (08):
  • [7] Haarnoja T, 2018, PR MACH LEARN RES, V80
  • [8] Hindmarsh AC, 2005, Lsoda, ordinary differential equation solver for stiff or non-stiff system
  • [9] OpenAI, 2022, INTR CHATGPT
  • [10] Schulman J., 2017, ARXIV, DOI DOI 10.48550/ARXIV.1707.06347