Deep reinforcement learning for solving steelmaking-continuous casting scheduling problems under time-of-use tariffs

被引：8

作者：

Pan, Ruilin ^{[1
,2
]}

Wang, Qiong ^{[1
]}

Cao, Jianhua ^{[1
,2
,3
]}

Zhou, Chunliu ^{[1
,2
]}

机构：

[1] Anhui Univ Technol, Sch Management Sci & Engn, Maanshan, Peoples R China

[2] Anhui Univ Technol, Anhui Higher Educ Inst, Key Lab Multidisciplinary Management & Control Com, Maanshan, Peoples R China

[3] Anhui Univ Technol, Xiushan Campus,Maxiang Rd, Maanshan 24032, Anhui, Peoples R China

来源：

INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH | 2024年 / 62卷 / 1-2期

基金：

中国国家自然科学基金;

关键词：

Steelmaking-continuous casting; scheduling; deep reinforcement learning; time-of-use tariffs; multi-objective optimisation; FLOW-SHOP; SINGLE-MACHINE; OPTIMIZATION; CONSUMPTION; COST;

D O I：

10.1080/00207543.2023.2267693

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

This paper proposes a novel intelligent scheduling method based on deep reinforcement learning (DRL) to solve the multi-objective steelmaking-continuous casting (SCC) scheduling problem, under time-of-use (TOU) tariffs for the first time. The intelligent scheduling system architecture is designed, and a mathematical model is established to minimise the total sojourn time and electricity cost. To effectively reduce production costs by avoiding peak periods of electricity consumption, the 'start time' of the system is generated based on the Markov Decision Process (MDP), and heuristic scheduling rules related to power cost are used as the action space, with corresponding reward functions designed according to the characteristics of these two objectives. To satisfy the continuous casting which is a particular SCC constraint, a backward strategy is developed. Additionally, a branching duelling double deep Q-network (BD3QN) is adapted to guide action selection and avoid blind search in the iteration process, and then applied to real-time scheduling. Numerical experiments demonstrate that the proposed method outperforms comparison algorithms in terms of solution quality and CPU times by a large margin.

引用

页码：404 / 420

页数：17

共 50 条

[41] An inverse reinforcement learning algorithm with population evolution mechanism for the multi-objective flexible job-shop scheduling problem under time-of-use electricity tariffs
Zhao, Fuqing
Wang, Weiyuan
Zhu, Ningning
Xu, Tianpeng
APPLIED SOFT COMPUTING, 2025, 170
[42] Two-stage parallel speed-scaling machine scheduling under time-of-use tariffs
Hongliang Zhang
Yujuan Wu
Ruilin Pan
Gongjie Xu
Journal of Intelligent Manufacturing, 2021, 32 : 91 - 112
[43] Solving flexible job shop scheduling problems via deep reinforcement learning
Yuan, Erdong
Wang, Liejun
Cheng, Shuli
Song, Shiji
Fan, Wei
Li, Yongming
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
[44] A TWO-STAGE GREEDY HEURISTIC FOR A FLOWSHOP SCHEDULING PROBLEM UNDER TIME-OF-USE ELECTRICITY TARIFFS
Pilerood, A. E.
Heydari, M.
Mazdeh, M. M.
SOUTH AFRICAN JOURNAL OF INDUSTRIAL ENGINEERING, 2018, 29 (01): : 143 - 154
[45] Leveraging Transfer Learning in Deep Reinforcement Learning for Solving Combinatorial Optimization Problems Under Uncertainty
Ezzahra Achamrah, Fatima
IEEE ACCESS, 2024, 12 : 181477 - 181497
[46] Research on Steelmaking-Continuous Casting Production Scheduling Problem Based on Augmented Lagrangian Relaxation Algorithm under Multi-Coupling Constraints
Sun, Liangliang
Jin, Hang
Yu, Yaqian
Li, Zhi
Xi, Jiali
IFAC PAPERSONLINE, 2019, 52 (01): : 820 - 825
[47] Energy-efficient scheduling in an unrelated parallel-machine environment under time-of-use electricity tariffs
Saberi-Aliabad, Hossein
Reisi-Nafchi, Mohammad
Moslehi, Ghasem
JOURNAL OF CLEANER PRODUCTION, 2020, 249 (249)
[48] An energy-efficient collaborative strategy of maintenance planning and production scheduling for serial-parallel systems under time-of-use tariffs
An, Xiangxin
Si, Guojin
Xia, Tangbin
Wang, Dong
Pan, Ershun
Xi, Lifeng
APPLIED ENERGY, 2023, 336
[49] Single-machine batch scheduling under time-of-use tariffs: new mixed-integer programming approaches
Cheng, Junheng
Chu, Feng
Liu, Ming
Xia, Weili
2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 3498 - 3503
[50] Solving task scheduling problems in cloud manufacturing via attention mechanism and deep reinforcement learning
Wang, Xiaohan
Zhang, Lin
Liu, Yongkui
Zhao, Chun
Wang, Kunyu
JOURNAL OF MANUFACTURING SYSTEMS, 2022, 65 : 452 - 468

← 1 2 3 4 5 →