Optimization of 2D Irregular Packing: Deep Reinforcement Learning with Dense Reward

被引：0

作者：

Crescitelli, Viviana ^{[1
]}

Oshima, Takashi ^{[1
]}

机构：

[1] Hitachi Ltd, Res & Dev Grp, Tokyo, Japan

来源：

INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING | 2024年 / 18卷 / 03期

关键词：

Irregular packing; reinforcement learning; factory automation; machine learning; reward; ALGORITHM;

D O I：

10.1142/S1793351X24430025

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper introduces a method to solve the 2D irregular packing problem using Deep Reinforcement Learning (Deep RL) for logistics. Our method employs a Q agent trained to predict the best placement within a container, maximizing available space. Unlike previous Deep RL algorithms, our method introduces a dense reward function at each packing step, providing immediate feedback and accelerating learning. To our knowledge, this is the first approach to use a dense reward to address the 2D irregular packing problem. Building on our earlier work, we improve the deep neural network by incorporating the Double Deep Q-Network (DDQN) framework to enhance our deep Q-learning approach, reducing overestimation biases and improving decision-making reliability. Simulation results show the method's effectiveness in completing the online 2D irregular packing tasks, achieving promising volume efficiency and packed piece metrics. This research extends our initial findings, highlighting the practical importance of DDQN and dense reward in advancing 2D irregular packing problem-solving. These advancements not only broaden the applications of deep learning but also hold practical importance for real-world logistics challenges.

引用

页码：405 / 416

页数：12

共 26 条

[1] MINIMIZING TRIM LOSS IN CUTTING RECTANGULAR BLANKS OF A SINGLE SIZE FROM A RECTANGULAR SHEET USING ORTHOGONAL GUILLOTINE CUTS
AGRAWAL, PK
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1993, 64 (03) : 410 - 422
[2] [Anonymous], 2016, INT J COMPUT INTEG M, V29, P1177
[3] A tutorial in irregular shape packing problems
Bennell, J. A.
Oliveira, J. F.
[J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 : S93 - S105
[4] A new bottom-left-fill heuristic algorithm for the two-dimensional irregular packing problem
Burke, Edmund
Hellier, Robert
Kendall, Graham
Whitwell, Glenn
[J]. OPERATIONS RESEARCH, 2006, 54 (03) : 587 - 601
[5] THE BOTTOM-LEFT BIN-PACKING HEURISTIC - AN EFFICIENT IMPLEMENTATION
CHAZELLE, B
[J]. IEEE TRANSACTIONS ON COMPUTERS, 1983, 32 (08) : 697 - 707
[6] COFFMAN EG, 1980, SIAM J COMPUT, V9, P808, DOI 10.1137/0209062
[7] CRESCITELLI S, 2021, 2021 26 IEEE INT C E, P1
[8] Crescitelli Viviana, 2023, 2023 Fifth International Conference on Transdisciplinary AI (TransAI), P270, DOI 10.1109/TransAI60598.2023.00037
[9] Csiszar Akos, 2021, 2021 4th International Conference on Artificial Intelligence for Industries (AI4I), P57, DOI 10.1109/AI4I51902.2021.00022
[10] A Hybrid Reinforcement Learning Algorithm for 2D Irregular Packing Problems
Fang, Jie
Rao, Yunqing
Zhao, Xusheng
Du, Bing
[J]. MATHEMATICS, 2023, 11 (02)

← 1 2 3 →