Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management

被引:0
|
作者
Liu, Xiaotian [1 ]
Hu, Ming [2 ]
Peng, Yijie [3 ]
Yang, Yaodong [4 ]
机构
[1] Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
[2] Univ Toronto, Rotman Sch Management, Toronto, ON M5S 3E6, Canada
[3] Peking Univ, PKU Wuhan Inst Artificial Intelligence, Guanghua Sch Management, Xiangjiang Lab, Beijing, Peoples R China
[4] Peking Univ, Inst Artificial Intelligence, PKU Wuhan Inst Artificial Intelligence, Beijing, Peoples R China
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
Multi-Echelon Inventory Management; Multi-Agent Reinforcement Learning; Bullwhip Effect; OPTIMAL POLICIES; OPTIMALITY;
D O I
10.1177/10591478241305863
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We apply heterogeneous-agent proximal policy optimization (HAPPO), a multi-agent deep reinforcement learning (MADRL) algorithm, to the decentralized multi-echelon inventory management problems in both a serial supply chain and a supply chain network. We also examine whether the upfront-only information-sharing mechanism used in MADRL helps alleviate the bullwhip effect. Our results show that policies constructed by HAPPO achieve lower overall costs than policies constructed by single-agent deep reinforcement learning and other heuristic policies. Also, the application of HAPPO results in a less significant bullwhip effect than policies constructed by single-agent deep reinforcement learning where information is not shared among actors. Somewhat surprisingly, compared to using the overall costs of the system as a minimization target for each actor, HAPPO achieves lower overall costs when the minimization target for each actor is a combination of its own costs and the overall costs of the system. Our results provide a new perspective on the benefit of information sharing inside the supply chain that helps alleviate the bullwhip effect and improve the overall performance of the system. Upfront information sharing and action coordination in model training among actors is essential, with the former even more essential, for improving a supply chain's overall performance when applying MADRL. Neither actors being fully self-interested nor actors being fully system-focused leads to the best practical performance of policies learned and constructed by MADRL. Our results also verify MADRL's potential in solving various multi-echelon inventory management problems with complex supply chain structures and in non-stationary market environments.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Multi-Agent Reinforcement Learning for Edge Resource Management with Reconstructed Environment
    Miao, Weiwei
    Zeng, Zeng
    Zhang, Mingxuan
    Quan, Siping
    Zhang, Zhen
    Li, Shihao
    Zhang, Li
    Sun, Qi
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 1729 - 1736
  • [42] Demand selection decisions for a multi-echelon inventory distribution system
    Shu, Jia
    Li, Zhengyi
    Huang, Liya
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2013, 64 (09) : 1307 - 1313
  • [43] Learning structured communication for multi-agent reinforcement learning
    Sheng, Junjie
    Wang, Xiangfeng
    Jin, Bo
    Yan, Junchi
    Li, Wenhao
    Chang, Tsung-Hui
    Wang, Jun
    Zha, Hongyuan
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2022, 36 (02)
  • [44] Multi-Agent Reinforcement Learning for Traffic Flow Management of Autonomous Vehicles
    Mushtaq, Anum
    Ul Haq, Irfan
    Sarwar, Muhammad Azeem
    Khan, Asifullah
    Khalil, Wajeeha
    Mughal, Muhammad Abid
    SENSORS, 2023, 23 (05)
  • [45] A Kanban based system for multi-echelon inventory management The case of pharmaceutical supply chain
    Mouaky, Malak
    Berrado, Abdelaziz
    Benabbou, Loubna
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON LOGISTICS OPERATIONS MANAGEMENT (GOL'16), 2016,
  • [46] Multi-agent reinforcement learning for character control
    Li, Cheng
    Fussell, Levi
    Komura, Taku
    VISUAL COMPUTER, 2021, 37 (12) : 3115 - 3123
  • [47] Leveraging Multi-Agent Reinforcement Learning for Digital Transformation in Supply Chain Inventory Optimization
    Zhang, Bo
    Tan, Wen Jun
    Cai, Wentong
    Zhang, Allan N.
    SUSTAINABILITY, 2024, 16 (22)
  • [48] Cooperative Multi-Agent Reinforcement Learning with Conversation Knowledge for Dialogue Management
    Lei, Shuyu
    Wang, Xiaojie
    Yuan, Caixia
    APPLIED SCIENCES-BASEL, 2020, 10 (08):
  • [49] A multi-agent reinforcement learning model for inventory transshipments under supply chain disruption
    Kim, Byeongmok
    Kim, Jong Gwang
    Lee, Seokcheon
    IISE TRANSACTIONS, 2024, 56 (07) : 715 - 728
  • [50] Multi-Agent Deep Reinforcement Learning for Spectrum Management in V2X with Social Roles
    Chen, Po-Yen
    Zheng, Yu-Heng
    Althamary, Ibrahim
    Chern, Jann-Long
    Huang, Chih-Wei
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2293 - 2298