Reinforcement learning approach for resource allocation in humanitarian logistics

被引：30

作者：

Yu, Lina ^{[1
]}

Zhang, Canrong ^{[2
]}

Jiang, Jingyan ^{[2
]}

Yang, Huasheng ^{[3
]}

Shang, Huayan ^{[1
]}

机构：

[1] Capital Univ Econ & Business, Sch Management & Engn, Beijing 100070, Peoples R China

[2] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China

[3] Tsinghua Univ, Dept Ind Engn, Beijing 100084, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2021年 / 173卷

基金：

中国国家自然科学基金;

关键词：

Humanitarian logistics; Resource allocation; Reinforcement learning; Q-learning; EMERGENCY RESPONSE; RELIEF DISTRIBUTION; QUICK RESPONSE; SUPPLY-CHAIN; MODEL; POLICIES;

D O I：

10.1016/j.eswa.2021.114663

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When a disaster strikes, it is important to allocate limited disaster relief resources to those in need. This paper considers the allocation of resources in humanitarian logistics using three critical performance indicators: efficiency, effectiveness and equity. Three separate costs are considered to represent these metrics, namely, the accessibility-based delivery cost, the starting state-based deprivation cost, and the terminal penalty cost. A mixed-integer nonlinear programming model with multiple objectives and multiple periods is proposed. A Qlearning algorithm, a type of reinforcement learning method, is developed to address the complex optimization problem. The principles of the proposed algorithm, including the learning agent and its actions, the environment and its states, and reward functions, are presented in detail. The parameter settings of the proposed algorithm are also discussed in the experimental section. In addition, the solution quality of the proposed algorithm is compared with that of the exact dynamic programming method and a heuristic algorithm. The experimental results show that the efficiency of the algorithm is better than that of the dynamic programming method and the accuracy of the algorithm is higher than that of the heuristic algorithm. Moreover, the Q-learning algorithm provides close to or even optimal solutions to the resource allocation problem by adjusting the value of the training episode K in practical applications.

引用

页数：14

共 50 条

[1] Rollout algorithms for resource allocation in humanitarian logistics
Yu, Lina
Yang, Huasheng
Miao, Lixin
Zhang, Canrong
IISE TRANSACTIONS, 2019, 51 (08) : 887 - 909
[2] A reinforcement learning approach to dynamic resource allocation
Vengerov, David
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2007, 20 (03) : 383 - 390
[3] A reinforcement Learning approach to resource allocation in genomic selection
Moeinizade, Saba
Hu, Guiping
Wang, Lizhi
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 14
[4] A hybrid Reinforcement Learning approach to autonomic resource allocation
Tesauro, Gerald
Jong, Nicholas K.
Das, Rajarshi
Bennani, Mohamed N.
3RD INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2005, : 65 - 73
[5] Monopolistic Models for Resource Allocation: A Probabilistic Reinforcement Learning Approach
Zhang, Yue
Song, Bin
Gao, Su
Du, Xiaojiang
Guizani, Mohsen
IEEE ACCESS, 2018, 6 : 49721 - 49731
[6] Sustaining incentive in grid resource allocation: A reinforcement learning approach
Lin, Li
Zhang, Yu
Huai, Jinpeng
CCGrid 2007: Seventh IEEE International Symposium on Cluster Computing and the Grid, 2007, : 145 - 152
[7] Novel methods for resource allocation in humanitarian logistics considering human suffering
Yu, Lina
Zhang, Canrong
Yang, Huasheng
Miao, Lixin
COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 119 : 1 - 20
[8] Developing an Humanitarian Logistics Framework Using a Reinforcement Learning Technique
Alqumaizi, Khalid I.
Dutta, Ashit Kumar
Alshehri, Sultan
WORLD JOURNAL OF ENTREPRENEURSHIP MANAGEMENT AND SUSTAINABLE DEVELOPMENT, 2023, 19 (3-4) : 15 - 32
[9] OPTIMIZING HUMANITARIAN LOGISTICS WITH DEEP REINFORCEMENT LEARNING AND DIGITAL TWINS
Soykan, Bulent
Rabadia, Ghaith
2024 ANNUAL MODELING AND SIMULATION CONFERENCE, ANNSIM 2024, 2024,
[10] DHL: Deep reinforcement learning-based approach for emergency supply distribution in humanitarian logistics
Fan, Junchao
Chang, Xiaolin
Misic, Jelena
Misic, Vojislav B.
Kang, Hongyue
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2022, 15 (05) : 2376 - 2389

← 1 2 3 4 5 →