Guided deep reinforcement learning framework using automated curriculum scheme for accurate motion planning

被引：1

作者：

Cho, Deun-Sol ^{[1
]}

Cho, Jae-Min ^{[1
]}

Kim, Won-Tae ^{[2
]}

机构：

[1] Koreatech Univ, Major Future Convergence Engn, Cheonan Si 31253, South Korea

[2] Koreatech Univ, Dept Comp Sci Engn, Cheonan Si 31253, South Korea

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 139卷

关键词：

Curriculum learning; Deep reinforcement learning; Motion planning; Robotic arm; Unsupervised learning;

D O I：

10.1016/j.engappai.2024.109541

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Collaborative robotic arms in smart factories should ensure the safety and interactivity during their operation such as reaching and grasping objects. Especially, the advanced motion planner including the path planning and the motion control functions is essential for human-machine co-working. Since the traditional physics-based motion planning approaches require extreme computational resources to obtain near-optimal solutions, deep reinforcement learning algorithms have been actively adopted and have effectively solved the limitation. They, however, have the easy task preference problem, primarily taking the simpler ways for the more rewards, due to randomly training the agents how to reach the target points in the large-scale search spaces. Therefore, we propose a novel curriculum-based deep reinforcement learning framework that makes the agents learn the motion planning tasks in unbiased ways from the ones with the low complexities to the others with the high complexities. It uses the unsupervised learning algorithms to cluster the target points with the similar task complexities for generating the effective curriculum. In addition, the review and buffer flushing mechanisms are integrated into the framework to mitigate the catastrophic forgetting problem where the agent abruptly lose the previous learned knowledge upon learning new one in the curriculum. The evaluation results of the proposed framework show that the curriculum significantly enhances the success rate on the task with the highest complexity from 12% to 56% and the mechanisms improve the success rate on the tasks with the easier complexities from an average of 66% to 76.5%, despite requiring less training time.

引用

页数：26

共 39 条

[21] A soft computing approach for inverse kinematics of robot manipulators [J].

Lopez-Franco, Carlos ;

Hernandez-Barragan, Jesus ;

Alanis, Alma Y. ;

Arana-Daniel, Nancy .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 74 :104-120

[22] Teacher-Student Curriculum Learning [J].

Matiisen, Tambet ;

Oliver, Avital ;

Cohen, Taco ;

Schulman, John .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) :3732-3740

[23] Dunn's index for cluster tendency assessment of pharmacological data sets [J].

Miguel Rivera-Borroto, Oscar ;

Rabassa-Gutierrez, Monica ;

del Corazon Grau-Abalo, Ricardo ;

Marrero-Ponce, Yovani ;

Manuel Garcia-de la Vega, Jose .

CANADIAN JOURNAL OF PHYSIOLOGY AND PHARMACOLOGY, 2012, 90 (04) :425-433

[24] Training a Robotic Arm Movement with Deep Reinforcement Learning [J].

Ni, Xiaohan ;

He, Xin ;

Matsumaru, Takafumi .

2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, :595-600

[25] Curriculum-reinforcement learning on simulation platform of tendon-driven high-degree of freedom underactuated manipulator [J].

Or, Keung ;

Wu, Kehua ;

Nakano, Kazashi ;

Ikeda, Masahiro ;

Ando, Mitsuhito ;

Kuniyoshi, Yasuo ;

Niiyama, Ryuma .

FRONTIERS IN ROBOTICS AND AI, 2023, 10

[26]

Saglietti Luca, 2022, Advances in Neural Information Processing Systems, V35, P21113

[27] Cluster Quality Analysis Using Silhouette Score [J].

Shahapure, Ketan Rajshekhar ;

Nicholas, Charles .

2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, :747-748

[28]

Shuai Liu, 2021, RiTA 2020: Proceedings of the 8th International Conference on Robot Intelligence Technology and Applications. Lecture Notes in Mechanical Engineering, P56, DOI 10.1007/978-981-16-4803-8_7

[29] MCC-EKF for Autonomous Car Security [J].

Singandhupe, Ashutosh ;

La, Hung Manh .

2020 FOURTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2020), 2020, :306-313

[30] Curriculum Learning: A Survey [J].

Soviany, Petru ;

Ionescu, Radu Tudor ;

Rota, Paolo ;

Sebe, Nicu .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (06) :1526-1565

← 1 2 3 4 →