A Reinforcement Learning Approach in Assignment of Task Priorities in Kinematic Control of Redundant Robots

被引:10
作者
Karimi, Masoud [1 ]
Ahmadi, Mojtaba [1 ]
机构
[1] Carleton Univ, Dept Mech & Aerosp Engn, Ottawa, ON K1S 5B6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Machinelearning for robot control; reinforcement learning; redundant robots; bipedal locomotion;
D O I
10.1109/LRA.2021.3133934
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Based on the recent advances of Deep Reinforcement Learning (DRL) and promising results, in this paper, we propose a framework for strict priority assignment in the context of kinematic control of redundant robots. The presented method focuses on redundant robots performing multiple concurrent tasks with potentially conflicting requirements and learns how to re-assign task priorities to ensure critical tasks get executed through smooth transitions. A Deep Q-Network (DQN) reinforcement learning agent is trained to assign the proper strict priorities to a stack of predefined kinematic control tasks (e.g., position control, orientation control, obstacle avoidance control, etc.) in a varying environment. Furthermore, to address the discontinuities in the control law due to the changes in the task priorities, a smoothing algorithm is proposed to ensure continuous reference velocities to the robot's joints. The proposed method is generic and extendable to a higher number of tasks and can be used when a reordering, swapping, addition, or deletion of tasks is required. The effectiveness of the proposed method is shown in simulation on a 5-DoF planar manipulator and a 7-DoF planar bipedal robot. The results show that the DRL agent is successful in assigning the correct hierarchy of tasks at each robot's state based on the global goal of the robot.
引用
收藏
页码:850 / 857
页数:8
相关论文
共 29 条
[1]  
Baerlocher P, 1998, 1998 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - PROCEEDINGS, VOLS 1-3, P323, DOI 10.1109/IROS.1998.724639
[2]   A Behavior-Based Reinforcement Learning Approach to Control Walking Bipedal Robots Under Unknown Disturbances [J].
Beranek, Richard ;
Karimi, Masoud ;
Ahmadi, Mojtaba .
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (05) :2710-2720
[3]  
Dai T., 2019, ARXIV PREPRINT ARXIV
[4]  
Dehio N, 2016, IEEE-RAS INT C HUMAN, P264, DOI 10.1109/HUMANOIDS.2016.7803287
[5]   Underwater Intervention With Remote Supervision via Satellite Communication: Developed Control Architecture and Experimental Results Within the Dexrov Project [J].
Di Lillo, Paolo ;
Simetti, Enrico ;
Wanderlingh, Francesco ;
Casalino, Giuseppe ;
Antonelli, Gianluca .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2021, 29 (01) :108-123
[6]   A Comparison of Damped Least Squares Algorithms for Inverse Kinematics of Robot Manipulators [J].
Di Vito, Daniele ;
Natale, Ciro ;
Antonelli, Gianluca .
IFAC PAPERSONLINE, 2017, 50 (01) :6869-6874
[7]  
Flacco F, 2014, IEEE INT C INT ROBOT, P2421, DOI 10.1109/IROS.2014.6942891
[8]   Pareto Optimality and Multiobjective Trajectory Planning for a 7-DOF Redundant Manipulator [J].
Guigue, Alexis ;
Ahmadi, Mojtaba ;
Langlois, Rob ;
Hayes, M. John D. .
IEEE TRANSACTIONS ON ROBOTICS, 2010, 26 (06) :1094-1099
[9]   Learning agile and dynamic motor skills for legged robots [J].
Hwangbo, Jemin ;
Lee, Joonho ;
Dosovitskiy, Alexey ;
Bellicoso, Dario ;
Tsounis, Vassilios ;
Koltun, Vladlen ;
Hutter, Marco .
SCIENCE ROBOTICS, 2019, 4 (26)
[10]   Analysis and Transfer of Human Movement Manipulability in Industry-like Activities [J].
Jaquier, Noemie ;
Rozo, Leonel ;
Calinon, Sylvain .
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, :11131-11138