Deep reinforcement learning-based ordering mechanism for performance optimization in multi-echelon supply chains

被引：3

作者：

Kurian, Dony S. ^{[1
]}

Pillai, V. Madhusudanan ^{[1
]}

Raut, Akash ^{[2
]}

Gautham, J. ^{[1
]}

机构：

[1] Natl Inst Technol Calicut, Dept Mech Engn, Kozhikode 673601, India

[2] Natl Inst Technol Calicut, Dept Elect Engn, Kozhikode, India

来源：

APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY | 2024年 / 40卷 / 05期

关键词：

beer distribution game; deep reinforcement learning; multi-agent system; proximal policy optimization; supply chain; BEER GAME; DECISION-MAKING; MANAGEMENT; BEHAVIOR; POLICIES; MODEL;

D O I：

10.1002/asmb.2723

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

The need for self-adaptive and intelligent supply chain systems is essential to meet the challenges of the current global markets. Despite the recent breakthroughs in artificial intelligence, literature still lacks the application of state-of-the-art methods to optimize the performance of supply chain ordering management problems. Thus, this paper proposes a relatively new Deep Reinforcement Learning-based Ordering Mechanism (DRLOM) for multi-echelon linear supply chain systems. Initially, the supply chain ordering management problem is formulated as an agent-based reinforcement learning model and, afterwards, solved using a recently developed policy-based algorithm called proximal policy optimization. The proposed approach (DRLOM) aids the assumed supply chain echelons, such as the Retailer, the Wholesaler, the Distributor and the Factory, to learn the optimal/near-optimal dynamic strategies for inventory ordering systems. The experimental results also validate that the proposed approach efficiently minimizes the system-wide total accumulated inventory costs under different problem instances than other ordering heuristics and evolutionary computation methods. Throughout this paper, benchmark findings from the literature are used to evaluate the performance of the proposed approach. Furthermore, limitations of the earlier works are addressed through this paper and contribute to the supply chain ordering management literature.

引用

页码：1433 / 1454

页数：22

共 52 条

[1] Deep Reinforcement Learning A brief survey [J].

Arulkumaran, Kai ;

Deisenroth, Marc Peter ;

Brundage, Miles ;

Bharath, Anil Anthony .

IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38

[2] Recent Advances in Hierarchical Reinforcement Learning [J].

Andrew G. Barto ;

Sridhar Mahadevan .

Discrete Event Dynamic Systems, 2003, 13 (4) :341-379

[3]

Bharti Shraddha, 2020, Innovative Product Design and Intelligent Manufacturing Systems. Select Proceedings of ICIPDIMS. Lecture Notes in Mechanical Engineering (LNME), P877, DOI 10.1007/978-981-15-2696-1_85

[4]

Brockman G., 2016, OPENAI GYM ARXIV PRE

[5] A reinforcement learning model for supply chain ordering management: An application to the beer game [J].

Chaharsooghi, S. Kamal ;

Heydari, Jafar ;

Zegordi, S. Hessameddin .

DECISION SUPPORT SYSTEMS, 2008, 45 (04) :949-959

[6] Decentralized supply chains subject to information delays [J].

Chen, FG .

MANAGEMENT SCIENCE, 1999, 45 (08) :1076-1090

[7] Stock allocation in a two-echelon distribution system controlled by (s, S) policies [J].

Chen, Haoxun ;

Dai, Bo ;

Li, Yuan ;

Zhang, Yidong ;

Wang, Xiaoqing ;

Deng, Yuming .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (03) :894-911

[8] Deep reinforcement learning for selecting demand forecast models to empower Industry 3.5 and an empirical study for a semiconductor component distributor [J].

Chien, Chen-Fu ;

Lin, Yun-Siang ;

Lin, Sheng-Kai .

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2020, 58 (09) :2784-2804

[9] Integrated planning and scheduling under production uncertainties: Bi-level model formulation and hybrid solution method [J].

Chu, Yunfei ;

You, Fengqi ;

Wassick, John M. ;

Agarwal, Anshul .

COMPUTERS & CHEMICAL ENGINEERING, 2015, 72 :255-272

[10] OPTIMAL POLICIES FOR A MULTI-ECHELON INVENTORY PROBLEM [J].

CLARK, AJ ;

SCARF, H .

MANAGEMENT SCIENCE, 1960, 6 (04) :475-490

← 1 2 3 4 5 6 →