Large-scale dynamic surgical scheduling under uncertainty by hierarchical reinforcement learning

被引:0
作者
Zhao, Lixiang [1 ,2 ]
Zhu, Han [1 ,2 ]
Zhang, Min [1 ,2 ]
Tang, Jiafu [1 ,2 ]
Wang, Yu [3 ]
机构
[1] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China
[2] Dongbei Univ Finance & Econ, Key Lab Liaoning Prov Data Anal & Optimizat Decis, Dalian 116025, Peoples R China
[3] Northeastern Univ, Sch Business Adm, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
SDG3: good health and well-being; dynamic surgical scheduling; hierarchical reinforcement learning; Markov decision process; dynamic unrelated parallel machine scheduling; MEAN WEIGHTED TARDINESS; ALGORITHMS; SURGERIES; DEMAND;
D O I
10.1080/00207543.2024.2361449
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Dynamic surgical scheduling within a workday is a complicated decision-making process. The critical challenge is that the actual duration of surgery and the arrival process of emergency patients are uncertain and unknown in advance. In this work, we propose a two-level dynamic scheduling framework based on hierarchical reinforcement learning to solve dynamic surgical scheduling problems considering both elective and emergency patients. Specifically, with the realisation of uncertainty, the upper-level agent (UA) dynamically decides whether to trigger rescheduling to optimise the workday total cost. The lower-level agent (LA) aims at obtaining subscheduling solutions when rescheduling is triggered. The subproblem at the LA can be formulated as a mixed integer programming model, which can be generalised to unrelated parallel machine scheduling with machine eligibility restrictions and sequence- and machine-dependent setup times. Such problems can be solved in small-scale cases and suffers the combinatorial explosion in large scale cases. To address this issue, we propose a heuristic method that is built upon deep reinforcement learning to obtain high-quality solutions. We conduct extensive simulation experiments with real data to test the effective of our framework. The results for different scenarios show that our proposed framework outperforms existing methods in terms of overall performance and has strong generalisation ability.
引用
收藏
页数:32
相关论文
共 61 条
  • [1] Improving operational effectiveness of tactical master plans for emergency and elective patients under stochastic demand and capacitated resources
    Adan, Ivo
    Bekkers, Jos
    Dellaert, Nico
    Jeunet, Jully
    Vissers, Jan
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 213 (01) : 290 - 308
  • [2] Dynamic Insertion of Emergency Surgeries With Different Waiting Time Targets
    Bargetto, Roberto
    Garaix, Thierry
    Xie, Xiaolan
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2019, 16 (01) : 87 - 99
  • [3] Recent Advances in Hierarchical Reinforcement Learning
    Andrew G. Barto
    Sridhar Mahadevan
    [J]. Discrete Event Dynamic Systems, 2003, 13 (4) : 341 - 379
  • [4] Machine learning for combinatorial optimization: A methodological tour d'horizon
    Bengio, Yoshua
    Lodi, Andrea
    Prouvost, Antoine
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2021, 290 (02) : 405 - 421
  • [5] Operating room planning and scheduling: A literature review
    Cardoen, Brecht
    Demeulemeester, Erik
    Belien, Jeroen
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 201 (03) : 921 - 932
  • [6] Csaji B. C., 2005, IFAC P VOLUMES, V38, P355, DOI [10.3182/20050703-6-CZ-1902.01481, DOI 10.3182/20050703-6-CZ-1902.01481]
  • [7] Heuristic and metaheuristic methods for the parallel unrelated machines scheduling problem: a survey
    Durasevic, Marko
    Jakobovic, Domagoj
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) : 3181 - 3289
  • [8] Rescheduling of elective patients upon the arrival of emergency patients
    Erdem, Ergin
    Qu, Xiuli
    Shi, Jing
    [J]. DECISION SUPPORT SYSTEMS, 2012, 54 (01) : 551 - 563
  • [9] Machine learning based integrated scheduling and rescheduling for elective and emergency patients in the operating theatre
    Eshghali, Masoud
    Kannan, Devika
    Salmanzadeh-Meydani, Navid
    Sikaroudi, Amir Mohammad Esmaieeli
    [J]. ANNALS OF OPERATIONS RESEARCH, 2024, 332 (1-3) : 989 - 1012
  • [10] Computation offloading in mobile edge computing networks: A survey
    Feng, Chuan
    Han, Pengchao
    Zhang, Xu
    Yang, Bowen
    Liu, Yejun
    Guo, Lei
    [J]. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 202