Efficient Reinforcement Learning of Task Planners for Robotic Palletization Through Iterative Action Masking Learning

被引:0
|
作者
Wu, Zheng [1 ]
Li, Yichuan [2 ]
Zhan, Wei [1 ]
Liu, Changliu [3 ]
Liu, Yun-Hui [2 ]
Tomizuka, Masayoshi [1 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 92093 USA
[2] Chinese Univ Hong Kong, T Stone Robot Inst, Dept Mech & Automat Engn, Hong Kong, Peoples R China
[3] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 11期
关键词
Three-dimensional displays; Robots; Task analysis; Pallets; Planning; Training; Thermal stability; Action space masking; reinforcement learning; robotic palletization; PACKING;
D O I
10.1109/LRA.2024.3440731
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
The development of robotic systems for palletization in logistics scenarios is of paramount importance, addressing critical efficiency and precision demands in supply chain management. This paper investigates the application of Reinforcement Learning (RL) in enhancing task planning for such robotic systems. Confronted with the substantial challenge of a vast action space, which is a significant impediment to efficiently apply out-of-the-shelf RL methods, our study introduces a novel method of utilizing supervised learning to iteratively prune and manage the action space effectively. By reducing the complexity of the action space, our approach not only accelerates the learning phase but also ensures the effectiveness and reliability of the task planning in robotic palletization. The experiemental results underscore the efficacy of this method, highlighting its potential in improving the performance of RL applications in complex and high-dimensional environments like logistics palletization.
引用
收藏
页码:9303 / 9310
页数:8
相关论文
共 50 条
  • [21] Adaptive Curriculum Learning: Optimizing Reinforcement Learning through Dynamic Task Sequencing
    Nesterova, M.
    Skrynnik, A.
    Panov, A.
    OPTICAL MEMORY AND NEURAL NETWORKS, 2024, 33 (SUPPL3) : S435 - S444
  • [22] Adjacency Constraint for Efficient Hierarchical Reinforcement Learning
    Zhang, Tianren
    Guo, Shangqi
    Tan, Tian
    Hu, Xiaolin
    Chen, Feng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4152 - 4166
  • [23] Heterogeneous Multi-robot Task Allocation and Scheduling via Reinforcement Learning
    Dai, Weiheng
    Rai, Utkarsh
    Chiun, Jimmy
    Cao, Yuhong
    Sartoretti, Guillaume
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2654 - 2661
  • [24] Multi-Task Reinforcement Learning With Attention-Based Mixture of Experts
    Cheng, Guangran
    Dong, Lu
    Cai, Wenzhe
    Sun, Changyin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (06) : 3811 - 3818
  • [25] Vehicular Cooperative Perception Through Action Branching and Federated Reinforcement Learning
    Abdel-Aziz, Mohamed K.
    Perfecto, Cristina
    Samarakoon, Sumudu
    Bennis, Mehdi
    Saad, Walid
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (02) : 891 - 903
  • [26] Data-Efficient Task Generalization via Probabilistic Model-Based Meta Reinforcement Learning
    Bhardwaj, Arjun
    Rothfuss, Jonas
    Sukhija, Bhavya
    As, Yarden
    Hutter, Marco
    Coros, Stelian
    Krause, Andreas
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3918 - 3925
  • [27] Safe and Sample-Efficient Reinforcement Learning for Clustered Dynamic Environments
    Chen, Hongyi
    Liu, Changliu
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 1928 - 1933
  • [28] Multiagent Reinforcement Learning for Strategic Decision Making and Control in Robotic Soccer Through Self-Play
    Brandao, Bruno
    De Lima, Telma Woerle
    Soares, Anderson
    Melo, Luckeciano
    Maximo, Marcos R. O. A.
    IEEE ACCESS, 2022, 10 : 72628 - 72642
  • [29] Obstacle-Avoidable Robotic Motion Planning Framework Based on Deep Reinforcement Learning
    Liu, Huashan
    Ying, Fengkang
    Jiang, Rongxin
    Shan, Yinghao
    Shen, Bo
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024, 29 (06) : 4377 - 4388
  • [30] Adapting Virtual Embodiment Through Reinforcement Learning
    Porssut, Thibault
    Hou, Yawen
    Blanke, Olaf
    Herbelin, Bruno
    Boulic, Ronan
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (09) : 3193 - 3205