Towards reliable robot packing system based on deep reinforcement learning

被引:20
作者
Xiong, Heng [1 ]
Ding, Kai [2 ]
Ding, Wan [2 ]
Peng, Jian [1 ]
Xu, Jianfeng [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China
[2] BOSCH Corp Res, Shanghai 200335, Peoples R China
关键词
Robotics; Online bin packing; Reinforcement learning; Manipulation; BIN PACKING; ALGORITHM; HEURISTICS; PICKING;
D O I
10.1016/j.aei.2023.102028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object packing by a robot has a wide range of applications in the logistics industry. This task requires the variable size items to be picked from piles one by one and then packed into another container immediately without the information about the unpicked items, modeled as an Online 3D Bin Packing Problem (3D-BPP). Due to limited information, it is a challenging problem to obtain an optimal solution for maximizing space utilization. Furthermore, existing studies do not consider practical constraints and assume an ideal perception and robotic packing manipulation. In this paper, we present a robot packing system with high performance and reliability. First, the Online 3D-BPP is formulated as a Markov decision process. A deep reinforcement learning (DRL) approach is proposed to tackle the problem utilizing the observations of the container and the current item. Specifically, a candidate map that indicates the potentially feasible placements based on heuristics is introduced to balance the exploration and exploitation in the considerable discrete action space. Second, we develop a physical robotic system to bridge the DRL agent from simulation to practical application. To make the packing manipulation resilient to uncertainties from the physical system, we design a motion primitive by moving the picked item close to its target placement from a collision-free area within the container. Experiments demonstrate that our method delivers superior performance against the baselines on two datasets, improving space utilization by over 2.7% and 3.8%, respectively, and the performance is not limited by the container size. Moreover, our robotic system can facilitate DRL to perform well in the real world.
引用
收藏
页数:10
相关论文
共 47 条
[1]   Jampacker: An Efficient and Reliable Robotic Bin Packing System for Cuboid Objects [J].
Agarwal, Marichi ;
Biswas, Swagata ;
Sarkar, Chayan ;
Paul, Sayan ;
Paul, Himadri Sekhar .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) :319-326
[2]   On-line three-dimensional packing problems: A review of off-line and on-line solution approaches [J].
Ali, Sara ;
Ramos, Antonio Galrao ;
Carravilla, Maria Antonia ;
Oliveira, Jose Fernando .
COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 168
[3]   Managing computational complexity using surrogate models: a critical review [J].
Alizadeh, Reza ;
Allen, Janet K. ;
Mistree, Farrokh .
RESEARCH IN ENGINEERING DESIGN, 2020, 31 (03) :275-298
[4]   A flexible and collaborative approach to robotic box-filling and item sorting [J].
Balatti, Pietro ;
Leonori, Mattia ;
Ajoudani, Arash .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 146
[5]  
Brockman G, 2016, Arxiv, DOI arXiv:1606.01540
[6]   An Online Packing Heuristic for the Three-Dimensional Container Loading Problem in Dynamic Environments and the Physical Internet [J].
Chi Trung Ha ;
Trung Thanh Nguyen ;
Lam Thu Bui ;
Wang, Ran .
APPLICATIONS OF EVOLUTIONARY COMPUTATION (EVOAPPLICATIONS 2017), PT II, 2017, 10200 :140-155
[7]   Approximation and online algorithms for multidimensional bin packing: A survey [J].
Christensen, Henrik I. ;
Khan, Arindam ;
Pokutta, Sebastian ;
Tetali, Prasad .
COMPUTER SCIENCE REVIEW, 2017, 24 :63-79
[8]   Analysis and Observations From the First Amazon Picking Challenge [J].
Correll, Nikolaus ;
Bekris, Kostas E. ;
Berenson, Dmitry ;
Brock, Oliver ;
Causo, Albert ;
Hauser, Kris ;
Okada, Kei ;
Rodriguez, Alberto ;
Romano, Joseph M. ;
Wurman, Peter R. .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (01) :172-188
[9]   Extreme point-based heuristics for three-dimensional bin packing [J].
Crainic, Teodor Gabriel ;
Perboli, Guido ;
Tadei, Roberto .
INFORMS JOURNAL ON COMPUTING, 2008, 20 (03) :368-384
[10]   Practical constraints in the container loading problem: Comprehensive formulations and exact algorithm [J].
do Nascimento, Oliviana Xavier ;
de Queiroz, Thiago Alves ;
Junqueira, Leonardo .
COMPUTERS & OPERATIONS RESEARCH, 2021, 128