A new resource-constrained project scheduling problem with ladder-type carbon trading prices and its algorithm based on deep reinforcement learning

被引:0
作者
Liu, Hao [1 ]
Zhang, Jingwen [1 ]
Zhang, Xinyue [1 ]
Chen, Zhi [1 ]
机构
[1] Northwestern Polytech Univ, Sch Management, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Ladder-type carbon trading prices; Limited construction site; RCPSP; Deep reinforcement learning; Tabu search; TEAM;
D O I
10.1016/j.eswa.2024.124794
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Carbon trading aims to reduce emissions through markets and has been adopted by many countries. This study proposes a new resource-constrained project scheduling problem with ladder-type carbon trading prices (RCPSPLCTP). The objective is to minimize the total cost, including the carbon trading cost. We develop a two-stage algorithm called MDDQN-TS to solve the RCPSP-LCTP. First, a multi-step double deep Q-network (MDDQN) with a modified convolutional neural network is trained on small-sized instances to learn the optimal scheduling policy. The learned policy is used to solve instances of various sizes. Second, a tabu search (TS) algorithm is used to further improve the solution obtained by the policy. Experimental results show that MDDQN-TS outperforms both the genetic algorithm (GA) and TS, particularly on large-sized instances. In terms of convergence speed, the MDDQN-TS algorithm demonstrates the fastest performance, followed by the TS algorithm, while GA exhibits the slowest convergence. Specifically, the number of schedules required for MDDQN-TS to converge is only 20.3 % similar to 37.9 % of TS. The experimental results also prove that ladder-type carbon trading prices can reduce carbon emissions more effectively than fixed prices.
引用
收藏
页数:15
相关论文
共 43 条
  • [1] Operational level emissions modelling of on-road construction equipment through field data analysis
    Barati, Khalegh
    Shen, Xuesong
    [J]. AUTOMATION IN CONSTRUCTION, 2016, 72 : 338 - 346
  • [2] Dynamic stochastic electric vehicle routing with safe reinforcement learning
    Basso, Rafael
    Kulcsar, Balazs
    Sanchez-Diaz, Ivan
    Qu, Xiaobo
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2022, 157
  • [3] Beijing Municipal Ecology and Environment Bureau, 2023, Notice on announcing the list of key carbon emission units and general reporting units in Beijing in 2022
  • [4] SCHEDULING SUBJECT TO RESOURCE CONSTRAINTS - CLASSIFICATION AND COMPLEXITY
    BLAZEWICZ, J
    LENSTRA, JK
    KAN, AHGR
    [J]. DISCRETE APPLIED MATHEMATICS, 1983, 5 (01) : 11 - 24
  • [5] A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone
    Bogyrbayeva, Aigerim
    Yoon, Taehyun
    Ko, Hanbum
    Lim, Sungbin
    Yun, Hyokun
    Kwon, Changhyun
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 148
  • [6] China association of building energy efficiency, 2022, Research report of China building energy consumption and carbon emissions
  • [7] Reinforcement learning-based dynamic planning of cut and fill operations for earthwork optimization
    Choi, Gwan
    Han, SangUk
    [J]. AUTOMATION IN CONSTRUCTION, 2023, 156
  • [8] Energy-cost-aware resource-constrained project scheduling for complex product system with activity splitting and recombining
    Du, Baigang
    Tan, Tian
    Guo, Jun
    Li, Yibing
    Guo, Shunsheng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
  • [9] Identification and resolution of work space conflicts in building construction
    Guo, SJ
    [J]. JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2002, 128 (04) : 287 - 295
  • [10] Guo X, 2004, Road Machinery & Construction Mechanization., V21, P1, DOI [10.3969/j.issn.1000-033X.2004.10.001, DOI 10.3969/J.ISSN.1000-033X.2004.10.001]