Transaction selection policy in tier-to-tier SBSRS by using Deep Q-Learning

被引:12
作者
Arslan, Bartu [1 ]
Ekren, Banu Yetkin [2 ,3 ]
机构
[1] Eindhoven Univ Technol, Dept Ind Engn & Innovat Sci, Eindhoven, Netherlands
[2] Yasar Univ, Dept Ind Engn, Izmir, Turkey
[3] Cranfield Univ, Sch Management, Cranfield, Beds, England
关键词
Logistics; SBSRS; automated warehousing; deep reinforcement learning; DQN; agent-based simulation; SHUTTLE-BASED STORAGE; AUTONOMOUS VEHICLE STORAGE; PERFORMANCE ESTIMATIONS; THROUGHPUT PERFORMANCE; RETRIEVAL-SYSTEMS; MODEL; TIME; DESIGN; LIFTS;
D O I
10.1080/00207543.2022.2148767
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper studies a Deep Q-Learning (DQL) method for transaction sequencing problems in an automated warehousing system, Shuttle-based Storage and Retrieval System (SBSRS), in which shuttles can move between tiers flexibly. Here, the system is referred to as tier-to-tier SBSRS (t-SBSRS), developed as an alternative design to tier-captive SBSRS (c-SBSRS). By the flexible travel of shuttles between tiers in t-SBSRS, the number of shuttles in the system may be reduced compared to its simulant c-SBSRS design. The flexible travel of shuttles makes the operation decisions more complex in that system, motivating us to explore whether integration of a machine learning approach would help to improve the system performance. We apply the DQL method for the transaction selection of shuttles in the system to attain process time advantage. The outcomes of the DQN are confronted with the well-applied heuristic approaches: first-come-first-serve (FIFO) and shortest process time (SPT) rules under different racking and numbers of shuttles scenarios. The results show that DQL outperforms the FIFO and SPT rules promising for the future of smart industry applications. Especially, compared to the well-applied SPT rule in industries, DQL improves the average cycle time per transaction by roughly 43% on average.
引用
收藏
页码:7353 / 7366
页数:14
相关论文
共 51 条
[1]  
Allied Market Research, 2020, AUT STOR RETR SYST M
[2]   Sequencing dynamic storage systems with multiple lifts and shuttles [J].
Carlo, Hector J. ;
Vis, Iris F. A. .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2012, 140 (02) :844-853
[3]  
Choudhary Ankit., 2019, Hands on Introduction to deep Q-learning using OpenAi Gym in python
[4]   Genetic Scheduling and Reinforcement Learning in Multirobot Systems for Intelligent Warehouses [J].
Dou, Jiajia ;
Chen, Chunlin ;
Yang, Pei .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[5]   An analytical approach for a performance calculation of shuttle-based storage and retrieval systems [J].
Eder, Michael .
PRODUCTION AND MANUFACTURING RESEARCH-AN OPEN ACCESS JOURNAL, 2019, 7 (01) :255-270
[6]   Warehouse Design under Class-Based Storage Policy of Shuttle-Based Storage and Retrieval System [J].
Ekren, B. Y. ;
Sari, Z. ;
Lerher, T. .
IFAC PAPERSONLINE, 2015, 48 (03) :1152-1154
[7]   An open queuing network-based tool for performance estimations in a shuttle-based storage and retrieval system [J].
Ekren, Banu Y. ;
Akpunar, Antl .
APPLIED MATHEMATICAL MODELLING, 2021, 89 :1678-1695
[8]   A simulation-based experimental design for SBS/RS warehouse design by considering energy related performance metrics [J].
Ekren, Banu Y. .
SIMULATION MODELLING PRACTICE AND THEORY, 2020, 98
[9]   A tool for time, variance and energy related performance estimations in a shuttle-based storage and retrieval system [J].
Ekren, Banu Y. ;
Akpunar, Anil ;
Sari, Zaki ;
Lerher, Tone .
APPLIED MATHEMATICAL MODELLING, 2018, 63 :109-127
[10]   An Approximate Solution for Semi-Open Queueing Network Model of an Autonomous Vehicle Storage and Retrieval System [J].
Ekren, Banu Y. ;
Heragu, Sunderesh S. ;
Krishnamurthy, Ananth ;
Malmborg, Charles J. .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2013, 10 (01) :205-215