Optimization of Apparel Supply Chain Using Deep Reinforcement Learning

被引：9

作者：

Chong, Ji Won ^{[1
]}

Kim, Wooju ^{[1
]}

Hong, Jun Seok ^{[2
]}

机构：

[1] Yonsei Univ, Dept Ind Engn, Seoul 03722, South Korea

[2] Kyonggi Univ, Dept Management Informat Syst, Suwon 16227, Gyeonggi Do, South Korea

来源：

IEEE ACCESS | 2022年 / 10卷

关键词：

Inventory management; Optimization; Inventory control; Transportation; Supply chains; Supply chain management; Reinforcement learning; Deep learning; Markov processes; Deep reinforcement learning; inventory management; markov decision process; supply chain management; soft actor critic;

D O I：

10.1109/ACCESS.2022.3205720

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An effective supply chain management system is indispensable for an enterprise with a supply chain network in several aspects. Especially, organized control over the production and transportation of its products is a key success factor for the enterprise to stay active without damaging its reputation. This case is also highly relevant to garment industries. In this study, an extensive Deep Reinforcement Learning study for apparel supply chain optimization is proposed and undertaken, with focus given to Soft Actor-Critic. Six models are experimented with in this study and are compared with respect to the sell-through rate, service level, and inventory-to-sales ratio. Soft Actor-Critic outperformed several other state-of-the-art Actor Critic models in managing inventories and fulfilling demands. Furthermore, explicit indicators are calculated to assess the performance of the models in the experiment. Soft Actor-Critic achieved a better balance between service level and sell-through rate by ensuring higher availability of the stocks to sell without overstocking. From numerical experiments, it has been shown that S-policy, Trust Region Policy Optimization, and Twin Delayed Deep Deterministic Policy Gradient have a good balance between service level and sell-through rate. Additionally, Soft Actor-Critic achieved a 7%, 41.6%, and 42.8% lower inventory sales ratio than the S-policy, Twin Delayed Deep Deterministic Policy Gradient, and Trust Region Policy Optimization models, indicating its superior ability in making the inventory stocks available to make sales and profit from them.

引用

页码：100367 / 100375

页数：9

共 27 条

[1]

[Anonymous], 2016, CoRR, abs/1509.02971

[2] Machine learning and soft computing applications in textile and clothing supply chain: Bibliometric and network analyses to delineate future research agenda [J].

Arora, Sanchi ;

Majumdar, Abhijit .

EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200

[3] Deep reinforcement learning for inventory control: A roadmap [J].

Boute, Robert N. ;

Gijsbrechts, Joren ;

van Jaarsveld, Willem ;

Vanvuchelen, Nathalie .

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 298 (02) :401-412

[4] Effective Management for Blockchain-Based Agri-Food Supply Chains Using Deep Reinforcement Learning [J].

Chen, Huilin ;

Chen, Zheyi ;

Lin, Feiting ;

Zhuang, Peifen .

IEEE ACCESS, 2021, 9 :36008-36018

[5] A dynamic ordering policy for a stochastic inventory problem with cash constraints [J].

Chen, Zhen ;

Rossi, Roberto .

OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2021, 102

[6] Digital twins probe into food cooling and biochemical quality changes for reducing losses in refrigerated supply chains [J].

Defraeye, Thijs ;

Tagliavini, Giorgia ;

Wu, Wentao ;

Prawiranto, Kevin ;

Schudel, Seraina ;

Kerisima, Mekdim Assefa ;

Verboven, Pieter ;

Buhlmann, Andreas .

RESOURCES CONSERVATION AND RECYCLING, 2019, 149 :778-794

[7]

Fujimoto S, 2018, Arxiv, DOI [arXiv:1802.09477, 10.48550/arXiv.1802.09477]

[8]

Haarnoja T, 2018, Arxiv, DOI arXiv:1801.01290

[9] Reinforcement learning approaches for specifying ordering policies of perishable inventory systems [J].

Kara, Ahmet ;

Dogan, Ibrahim .

EXPERT SYSTEMS WITH APPLICATIONS, 2018, 91 :150-158

[10]

Kingma DP, 2014, ADV NEUR IN, V27

← 1 2 3 →