Case-based reinforcement learning for dynamic inventory control in a multi-agent supply-chain system

被引:78
作者
Jiang, Chengzhi [1 ]
Sheng, Zhaohan [1 ]
机构
[1] Nanjing Univ, Dept Management & Engn, Nanjing 210093, Peoples R China
关键词
Inventory control; Reinforcement learning; Supply-chain management; Multi-agent simulation;
D O I
10.1016/j.eswa.2008.07.036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) appeals to many researchers in recent years because of its generality. It is an approach to machine intelligence that learns to achieve the given goal by trial-and-error iterations with its environment. This paper proposes a case-based reinforcement learning algorithm (CRL) for dynamic inventory control in a multi-agent supply-chain system. Traditional time-triggered and event-triggered ordering policies remain popular because they are easy to implement. But in the dynamic environment, the results of them may become inaccurate causing excessive inventory (cost) or shortage. Under the condition of nonstationary, customer demand, the S value of (T, S) and (Q, S) inventory review method is learnt using the proposed algorithm for satisfying target service level, respectively. Multi-agent simulation of a simplified two-echelon supply chain, where proposed algorithm is implemented, is run for a few times. The results show the effectiveness of CRL in both review methods. We also consider a framework for general learning method based on proposed one, which may be helpful in all aspects of supply-chain management (SCM). Hence, it is suggested that well-designed "connections" are necessary to be built between CRL, multi-agent system (MAS) and SCM. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6520 / 6526
页数:7
相关论文
共 18 条
[1]   Decentralized inventory control in a two-level distribution system [J].
Andersson, J ;
Marklund, J .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2000, 127 (03) :483-506
[2]  
[Anonymous], 1996, REINFORCEMENT LEARNI
[3]   Inventory control of spare parts using a Bayesian approach: A case study [J].
Aronis, KP ;
Magou, L ;
Dekker, R ;
Tagaras, G .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2004, 154 (03) :730-739
[4]   Cyclic production-inventory planning and control in the pre-Deco industry: A case study [J].
Ashayeri, J. ;
Heuts, R. J. M. ;
Lansdaal, H. G. L. ;
Strijbosch, L. W. G. .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2006, 103 (02) :715-725
[5]  
Chen Y, 2006, COMPUTERS OPERATIONS, V35, P776
[6]   Modeling and optimizing a vendor managed replenishment system using machine learning and genetic algorithms [J].
Chi, Hoi-Ming ;
Ersoy, Okan K. ;
Moskowitz, Herbert ;
Ward, Jim .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 180 (01) :174-193
[7]   Optimal control policy for a standing order inventory system [J].
Chiang, Chi .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 182 (02) :695-703
[8]   Inventory control of particulate processes [J].
Diez, Marta Duenas ;
Ydstie, B. Erik ;
Fjeld, Magne ;
Lie, Bernt .
COMPUTERS & CHEMICAL ENGINEERING, 2008, 32 (1-2) :46-67
[9]  
ELHAFSI M, 2007, EUR J OPER RES, DOI DOI 10.1016/J.EJOR.2007.12.00
[10]   MASCF: A generic process-centered methodological framework for analysis and design of multi-agent supply chain systems [J].
Govindu, Ramakrishna ;
Chinnam, Ratna Babu .
COMPUTERS & INDUSTRIAL ENGINEERING, 2007, 53 (04) :584-609