A new resource-constrained project scheduling problem with ladder-type carbon trading prices and its algorithm based on deep reinforcement learning

被引：0

作者：

Liu, Hao ^{[1
]}

Zhang, Jingwen ^{[1
]}

Zhang, Xinyue ^{[1
]}

Chen, Zhi ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Management, Xian 710072, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 255卷

基金：

中国国家自然科学基金;

关键词：

Ladder-type carbon trading prices; Limited construction site; RCPSP; Deep reinforcement learning; Tabu search; TEAM;

D O I：

10.1016/j.eswa.2024.124794

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Carbon trading aims to reduce emissions through markets and has been adopted by many countries. This study proposes a new resource-constrained project scheduling problem with ladder-type carbon trading prices (RCPSPLCTP). The objective is to minimize the total cost, including the carbon trading cost. We develop a two-stage algorithm called MDDQN-TS to solve the RCPSP-LCTP. First, a multi-step double deep Q-network (MDDQN) with a modified convolutional neural network is trained on small-sized instances to learn the optimal scheduling policy. The learned policy is used to solve instances of various sizes. Second, a tabu search (TS) algorithm is used to further improve the solution obtained by the policy. Experimental results show that MDDQN-TS outperforms both the genetic algorithm (GA) and TS, particularly on large-sized instances. In terms of convergence speed, the MDDQN-TS algorithm demonstrates the fastest performance, followed by the TS algorithm, while GA exhibits the slowest convergence. Specifically, the number of schedules required for MDDQN-TS to converge is only 20.3 % similar to 37.9 % of TS. The experimental results also prove that ladder-type carbon trading prices can reduce carbon emissions more effectively than fixed prices.

引用

页数：15

共 43 条

[1] Operational level emissions modelling of on-road construction equipment through field data analysis
Barati, Khalegh
Shen, Xuesong
[J]. AUTOMATION IN CONSTRUCTION, 2016, 72 : 338 - 346
[2] Dynamic stochastic electric vehicle routing with safe reinforcement learning
Basso, Rafael
Kulcsar, Balazs
Sanchez-Diaz, Ivan
Qu, Xiaobo
[J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 157
[3] Beijing Municipal Ecology and Environment Bureau, 2023, Notice on announcing the list of key carbon emission units and general reporting units in Beijing in 2022
[4] SCHEDULING SUBJECT TO RESOURCE CONSTRAINTS - CLASSIFICATION AND COMPLEXITY
BLAZEWICZ, J
LENSTRA, JK
KAN, AHGR
[J]. DISCRETE APPLIED MATHEMATICS, 1983, 5 (01) : 11 - 24
[5] A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone
Bogyrbayeva, Aigerim
Yoon, Taehyun
Ko, Hanbum
Lim, Sungbin
Yun, Hyokun
Kwon, Changhyun
[J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 148
[6] China association of building energy efficiency, 2022, Research report of China building energy consumption and carbon emissions
[7] Reinforcement learning-based dynamic planning of cut and fill operations for earthwork optimization
Choi, Gwan
Han, SangUk
[J]. AUTOMATION IN CONSTRUCTION, 2023, 156
[8] Energy-cost-aware resource-constrained project scheduling for complex product system with activity splitting and recombining
Du, Baigang
Tan, Tian
Guo, Jun
Li, Yibing
Guo, Shunsheng
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
[9] Identification and resolution of work space conflicts in building construction
Guo, SJ
[J]. JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2002, 128 (04) : 287 - 295
[10] Guo X, 2004, Road Machinery & Construction Mechanization., V21, P1, DOI [10.3969/j.issn.1000-033X.2004.10.001, DOI 10.3969/J.ISSN.1000-033X.2004.10.001]

← 1 2 3 4 5 →