A Vision-Based Bio-Inspired Reinforcement Learning Algorithms for Manipulator Obstacle Avoidance

被引：3

作者：

Singh, Abhilasha ^{[1
]}

Shakeel, Mohamed ^{[2
]}

Kalaichelvi, V ^{[1
]}

Karthikeyan, R. ^{[2
]}

机构：

[1] Birla Inst Technol & Sci Pilani, Dept Elect & Elect Engn, Dubai Campus,POB 345 055, Dubai, U Arab Emirates

[2] Birla Inst Technol & Sci Pilani, Dept Mech Engn, Dubai Campus,POB 345 055, Dubai, U Arab Emirates

来源：

ELECTRONICS | 2022年 / 11卷 / 21期

关键词：

Q-learning; DQN; SARSA; DDQN; homogeneous transformation; optimization; obstacle avoidance; MOBILE ROBOT; ENVIRONMENTS;

D O I：

10.3390/electronics11213636

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Path planning for robotic manipulators has proven to be a challenging issue in industrial applications. Despite providing precise waypoints, the traditional path planning algorithm requires a predefined map and is ineffective in complex, unknown environments. Reinforcement learning techniques can be used in cases where there is a no environmental map. For vision-based path planning and obstacle avoidance in assembly line operations, this study introduces various Reinforcement Learning (RL) algorithms based on discrete state-action space, such as Q-Learning, Deep Q Network (DQN), State-Action-Reward-State-Action (SARSA), and Double Deep Q Network (DDQN). By positioning the camera in an eye-to-hand position, this work used color-based segmentation to identify the locations of obstacles, start, and goal points. The homogeneous transformation technique was used to further convert the pixel values into robot coordinates. Furthermore, by adjusting the number of episodes, steps per episode, learning rate, and discount factor, a performance study of several RL algorithms was carried out. To further tune the training hyperparameters, genetic algorithms (GA) and particle swarm optimization (PSO) were employed. The length of the path travelled, the average reward, the average number of steps, and the time required to reach the objective point were all measured and compared for each of the test cases. Finally, the suggested methodology was evaluated using a live camera that recorded the robot workspace in real-time. The ideal path was then drawn using a TAL BRABO 5 DOF manipulator. It was concluded that waypoints obtained via Double DQN showed an improved performance and were able to avoid the obstacles and reach the goal point smoothly and efficiently.

引用

页数：26

共 38 条

[21]

manufacturing-today, PROFILE TAL MANUFACT

[22]

Mohan Prajval, 2021, 2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS), P811, DOI 10.1109/ICICCS51141.2021.9432202

[23]

Paavai Anand P., 2021, International Journal Of Computing and Digital System., V11, P1

[24]

Pouyan M, 2014, 2014 INTERNATIONAL CONGRESS ON TECHNOLOGY, COMMUNICATION AND KNOWLEDGE (ICTCK)

[25] A novel mobile robot navigation method based on deep reinforcement learning [J].

Quan, Hao ;

Li, Yansheng ;

Zhang, Yi .

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03)

[26]

Sanyal Alok, 2021, Advances in Interdisciplinary Engineering. Select Proceedings of FLAME 2020. Lecture Notes in Mechanical Engineering (LNME), P555, DOI 10.1007/978-981-15-9956-9_55

[27]

Shukla P, 2018, 2018 CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (CICT'18)

[28]

Siyu Zhou, 2018, 2018 IEEE International Conference on Information and Automation (ICIA). Proceedings, P366, DOI 10.1109/ICInfA.2018.8812452

[29]

Sutton RS, 2018, ADAPT COMPUT MACH LE, P1

[30] Path Planning for Mobile Robot Navigation in Unknown Indoor Environments Using Hybrid PSOFS Algorithm [J].

Wahab, Mohd Nadhir Ab ;

Lee, Ching May ;

Akbar, Muhammad Firdaus ;

Hassan, Fadratul Hafinaz .

IEEE ACCESS, 2020, 8 :161805-161815

← 1 2 3 4 →