Efficient Reinforcement Learning of Task Planners for Robotic Palletization Through Iterative Action Masking Learning

被引：0

作者：

Wu, Zheng ^{[1
]}

Li, Yichuan ^{[2
]}

Zhan, Wei ^{[1
]}

Liu, Changliu ^{[3
]}

Liu, Yun-Hui ^{[2
]}

Tomizuka, Masayoshi ^{[1
]}

机构：

[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 92093 USA

[2] Chinese Univ Hong Kong, T Stone Robot Inst, Dept Mech & Automat Engn, Hong Kong, Peoples R China

[3] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期

关键词：

Three-dimensional displays; Robots; Task analysis; Pallets; Planning; Training; Thermal stability; Action space masking; reinforcement learning; robotic palletization; PACKING;

D O I：

10.1109/LRA.2024.3440731

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The development of robotic systems for palletization in logistics scenarios is of paramount importance, addressing critical efficiency and precision demands in supply chain management. This paper investigates the application of Reinforcement Learning (RL) in enhancing task planning for such robotic systems. Confronted with the substantial challenge of a vast action space, which is a significant impediment to efficiently apply out-of-the-shelf RL methods, our study introduces a novel method of utilizing supervised learning to iteratively prune and manage the action space effectively. By reducing the complexity of the action space, our approach not only accelerates the learning phase but also ensures the effectiveness and reliability of the task planning in robotic palletization. The experiemental results underscore the efficacy of this method, highlighting its potential in improving the performance of RL applications in complex and high-dimensional environments like logistics palletization.

引用

页码：9303 / 9310

页数：8

共 50 条

[21] Adaptive Curriculum Learning: Optimizing Reinforcement Learning through Dynamic Task Sequencing
Nesterova, M.
Skrynnik, A.
Panov, A.
OPTICAL MEMORY AND NEURAL NETWORKS, 2024, 33 (SUPPL3) : S435 - S444
[22] Adjacency Constraint for Efficient Hierarchical Reinforcement Learning
Zhang, Tianren
Guo, Shangqi
Tan, Tian
Hu, Xiaolin
Chen, Feng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4152 - 4166
[23] Heterogeneous Multi-robot Task Allocation and Scheduling via Reinforcement Learning
Dai, Weiheng
Rai, Utkarsh
Chiun, Jimmy
Cao, Yuhong
Sartoretti, Guillaume
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2654 - 2661
[24] Multi-Task Reinforcement Learning With Attention-Based Mixture of Experts
Cheng, Guangran
Dong, Lu
Cai, Wenzhe
Sun, Changyin
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3811 - 3818
[25] Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning
Abdel-Aziz, Mohamed K.
Perfecto, Cristina
Samarakoon, Sumudu
Bennis, Mehdi
Saad, Walid
IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (02) : 891 - 903
[26] Data-Efficient Task Generalization via Probabilistic Model-Based Meta Reinforcement Learning
Bhardwaj, Arjun
Rothfuss, Jonas
Sukhija, Bhavya
As, Yarden
Hutter, Marco
Coros, Stelian
Krause, Andreas
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3918 - 3925
[27] Safe and Sample-Efficient Reinforcement Learning for Clustered Dynamic Environments
Chen, Hongyi
Liu, Changliu
IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 1928 - 1933
[28] Multiagent Reinforcement Learning for Strategic Decision Making and Control in Robotic Soccer Through Self-Play
Brandao, Bruno
De Lima, Telma Woerle
Soares, Anderson
Melo, Luckeciano
Maximo, Marcos R. O. A.
IEEE ACCESS, 2022, 10 : 72628 - 72642
[29] Obstacle-Avoidable Robotic Motion Planning Framework Based on Deep Reinforcement Learning
Liu, Huashan
Ying, Fengkang
Jiang, Rongxin
Shan, Yinghao
Shen, Bo
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (06) : 4377 - 4388
[30] Adapting Virtual Embodiment Through Reinforcement Learning
Porssut, Thibault
Hou, Yawen
Blanke, Olaf
Herbelin, Bruno
Boulic, Ronan
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (09) : 3193 - 3205

← 1 2 3 4 5 →