Deep reinforcement learning for solving steelmaking-continuous casting scheduling problems under time-of-use tariffs

被引:8
|
作者
Pan, Ruilin [1 ,2 ]
Wang, Qiong [1 ]
Cao, Jianhua [1 ,2 ,3 ]
Zhou, Chunliu [1 ,2 ]
机构
[1] Anhui Univ Technol, Sch Management Sci & Engn, Maanshan, Peoples R China
[2] Anhui Univ Technol, Anhui Higher Educ Inst, Key Lab Multidisciplinary Management & Control Com, Maanshan, Peoples R China
[3] Anhui Univ Technol, Xiushan Campus,Maxiang Rd, Maanshan 24032, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Steelmaking-continuous casting; scheduling; deep reinforcement learning; time-of-use tariffs; multi-objective optimisation; FLOW-SHOP; SINGLE-MACHINE; OPTIMIZATION; CONSUMPTION; COST;
D O I
10.1080/00207543.2023.2267693
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper proposes a novel intelligent scheduling method based on deep reinforcement learning (DRL) to solve the multi-objective steelmaking-continuous casting (SCC) scheduling problem, under time-of-use (TOU) tariffs for the first time. The intelligent scheduling system architecture is designed, and a mathematical model is established to minimise the total sojourn time and electricity cost. To effectively reduce production costs by avoiding peak periods of electricity consumption, the 'start time' of the system is generated based on the Markov Decision Process (MDP), and heuristic scheduling rules related to power cost are used as the action space, with corresponding reward functions designed according to the characteristics of these two objectives. To satisfy the continuous casting which is a particular SCC constraint, a backward strategy is developed. Additionally, a branching duelling double deep Q-network (BD3QN) is adapted to guide action selection and avoid blind search in the iteration process, and then applied to real-time scheduling. Numerical experiments demonstrate that the proposed method outperforms comparison algorithms in terms of solution quality and CPU times by a large margin.
引用
收藏
页码:404 / 420
页数:17
相关论文
共 50 条
  • [41] An inverse reinforcement learning algorithm with population evolution mechanism for the multi-objective flexible job-shop scheduling problem under time-of-use electricity tariffs
    Zhao, Fuqing
    Wang, Weiyuan
    Zhu, Ningning
    Xu, Tianpeng
    APPLIED SOFT COMPUTING, 2025, 170
  • [42] Two-stage parallel speed-scaling machine scheduling under time-of-use tariffs
    Hongliang Zhang
    Yujuan Wu
    Ruilin Pan
    Gongjie Xu
    Journal of Intelligent Manufacturing, 2021, 32 : 91 - 112
  • [43] Solving flexible job shop scheduling problems via deep reinforcement learning
    Yuan, Erdong
    Wang, Liejun
    Cheng, Shuli
    Song, Shiji
    Fan, Wei
    Li, Yongming
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [44] A TWO-STAGE GREEDY HEURISTIC FOR A FLOWSHOP SCHEDULING PROBLEM UNDER TIME-OF-USE ELECTRICITY TARIFFS
    Pilerood, A. E.
    Heydari, M.
    Mazdeh, M. M.
    SOUTH AFRICAN JOURNAL OF INDUSTRIAL ENGINEERING, 2018, 29 (01): : 143 - 154
  • [45] Leveraging Transfer Learning in Deep Reinforcement Learning for Solving Combinatorial Optimization Problems Under Uncertainty
    Ezzahra Achamrah, Fatima
    IEEE ACCESS, 2024, 12 : 181477 - 181497
  • [46] Research on Steelmaking-Continuous Casting Production Scheduling Problem Based on Augmented Lagrangian Relaxation Algorithm under Multi-Coupling Constraints
    Sun, Liangliang
    Jin, Hang
    Yu, Yaqian
    Li, Zhi
    Xi, Jiali
    IFAC PAPERSONLINE, 2019, 52 (01): : 820 - 825
  • [47] Energy-efficient scheduling in an unrelated parallel-machine environment under time-of-use electricity tariffs
    Saberi-Aliabad, Hossein
    Reisi-Nafchi, Mohammad
    Moslehi, Ghasem
    JOURNAL OF CLEANER PRODUCTION, 2020, 249 (249)
  • [48] An energy-efficient collaborative strategy of maintenance planning and production scheduling for serial-parallel systems under time-of-use tariffs
    An, Xiangxin
    Si, Guojin
    Xia, Tangbin
    Wang, Dong
    Pan, Ershun
    Xi, Lifeng
    APPLIED ENERGY, 2023, 336
  • [49] Single-machine batch scheduling under time-of-use tariffs: new mixed-integer programming approaches
    Cheng, Junheng
    Chu, Feng
    Liu, Ming
    Xia, Weili
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 3498 - 3503
  • [50] Solving task scheduling problems in cloud manufacturing via attention mechanism and deep reinforcement learning
    Wang, Xiaohan
    Zhang, Lin
    Liu, Yongkui
    Zhao, Chun
    Wang, Kunyu
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 65 : 452 - 468