Reinforcement learning approach for resource allocation in humanitarian logistics

被引:30
|
作者
Yu, Lina [1 ]
Zhang, Canrong [2 ]
Jiang, Jingyan [2 ]
Yang, Huasheng [3 ]
Shang, Huayan [1 ]
机构
[1] Capital Univ Econ & Business, Sch Management & Engn, Beijing 100070, Peoples R China
[2] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
[3] Tsinghua Univ, Dept Ind Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Humanitarian logistics; Resource allocation; Reinforcement learning; Q-learning; EMERGENCY RESPONSE; RELIEF DISTRIBUTION; QUICK RESPONSE; SUPPLY-CHAIN; MODEL; POLICIES;
D O I
10.1016/j.eswa.2021.114663
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When a disaster strikes, it is important to allocate limited disaster relief resources to those in need. This paper considers the allocation of resources in humanitarian logistics using three critical performance indicators: efficiency, effectiveness and equity. Three separate costs are considered to represent these metrics, namely, the accessibility-based delivery cost, the starting state-based deprivation cost, and the terminal penalty cost. A mixed-integer nonlinear programming model with multiple objectives and multiple periods is proposed. A Qlearning algorithm, a type of reinforcement learning method, is developed to address the complex optimization problem. The principles of the proposed algorithm, including the learning agent and its actions, the environment and its states, and reward functions, are presented in detail. The parameter settings of the proposed algorithm are also discussed in the experimental section. In addition, the solution quality of the proposed algorithm is compared with that of the exact dynamic programming method and a heuristic algorithm. The experimental results show that the efficiency of the algorithm is better than that of the dynamic programming method and the accuracy of the algorithm is higher than that of the heuristic algorithm. Moreover, the Q-learning algorithm provides close to or even optimal solutions to the resource allocation problem by adjusting the value of the training episode K in practical applications.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Rollout algorithms for resource allocation in humanitarian logistics
    Yu, Lina
    Yang, Huasheng
    Miao, Lixin
    Zhang, Canrong
    IISE TRANSACTIONS, 2019, 51 (08) : 887 - 909
  • [2] A reinforcement learning approach to dynamic resource allocation
    Vengerov, David
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2007, 20 (03) : 383 - 390
  • [3] A reinforcement Learning approach to resource allocation in genomic selection
    Moeinizade, Saba
    Hu, Guiping
    Wang, Lizhi
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 14
  • [4] A hybrid Reinforcement Learning approach to autonomic resource allocation
    Tesauro, Gerald
    Jong, Nicholas K.
    Das, Rajarshi
    Bennani, Mohamed N.
    3RD INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2005, : 65 - 73
  • [5] Monopolistic Models for Resource Allocation: A Probabilistic Reinforcement Learning Approach
    Zhang, Yue
    Song, Bin
    Gao, Su
    Du, Xiaojiang
    Guizani, Mohsen
    IEEE ACCESS, 2018, 6 : 49721 - 49731
  • [6] Sustaining incentive in grid resource allocation: A reinforcement learning approach
    Lin, Li
    Zhang, Yu
    Huai, Jinpeng
    CCGrid 2007: Seventh IEEE International Symposium on Cluster Computing and the Grid, 2007, : 145 - 152
  • [7] Novel methods for resource allocation in humanitarian logistics considering human suffering
    Yu, Lina
    Zhang, Canrong
    Yang, Huasheng
    Miao, Lixin
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 119 : 1 - 20
  • [8] Developing an Humanitarian Logistics Framework Using a Reinforcement Learning Technique
    Alqumaizi, Khalid I.
    Dutta, Ashit Kumar
    Alshehri, Sultan
    WORLD JOURNAL OF ENTREPRENEURSHIP MANAGEMENT AND SUSTAINABLE DEVELOPMENT, 2023, 19 (3-4) : 15 - 32
  • [9] OPTIMIZING HUMANITARIAN LOGISTICS WITH DEEP REINFORCEMENT LEARNING AND DIGITAL TWINS
    Soykan, Bulent
    Rabadia, Ghaith
    2024 ANNUAL MODELING AND SIMULATION CONFERENCE, ANNSIM 2024, 2024,
  • [10] DHL: Deep reinforcement learning-based approach for emergency supply distribution in humanitarian logistics
    Fan, Junchao
    Chang, Xiaolin
    Misic, Jelena
    Misic, Vojislav B.
    Kang, Hongyue
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (05) : 2376 - 2389