Towards optimal control of HPV model using safe reinforcement learning with actor-critic neural networks

被引:0
作者
Amirabadi, Roya Khalili [1 ]
Fard, Omid S. [1 ]
Farimani, Mohsen Jalaeian [2 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Appl Math, Mashhad, Iran
[2] Politecn Milan, Dept Elect Informat & Bioengn, Milan, Italy
关键词
Optimal control; Reinforcement learning; Experience replay; Safety; Actor-critic neural network; Control barrier function; Nonlinear systems; HPV model; HUMAN-PAPILLOMAVIRUS; VACCINE; TRANSMISSION; IMPACT;
D O I
10.1016/j.eswa.2024.125783
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel approach that applies state-of-the-art concepts in reinforcement learning (RL) to the optimal control of human papillomavirus (HPV) infection. The methodology transforms the nonlinear optimal control problem into a constrained nonlinear programming problem, thus allowing effective application of the RL algorithms. This approach combines Hamilton-Jacobi-Bellman (HJB) equations with actor-critic neural networks and control barrier functions to obtain an adaptive strategy for optimal vaccination and screening against HPV infection. A key innovation is the Sophia optimizer with experience replay, addressing the critical need for online data application in infectious disease control. Unlike the traditional methods that rely on the accumulation of extensive data, this approach utilizes experience replay to learn and adapt continuously, hence giving practical solutions for diseases like HPV where waiting for data is not practical or desirable. Experience replay helps to store and reuse past experience, hence improving the learning efficiency and stability of the system. This is an important feature for online applications to make sure that an RL model responds quickly enough to changing epidemiological conditions. Numerical simulations demonstrate the effectiveness of this approach in minimizing HPV prevalence and optimizing resource allocation. This research offers significant insights into the application of advanced control strategies in infectious disease management, highlighting the potential of RL to address complex epidemiological challenges. The ability to apply these techniques to online underscores the importance of adaptive and responsive strategies in public health.
引用
收藏
页数:24
相关论文
共 23 条
[1]   Combining hybrid metaheuristic algorithms and reinforcement learning to improve the optimal control of nonlinear continuous-time systems with input constraints [J].
Amirabadi, Roya Khalili ;
Fard, Omid Solaymani .
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
[2]   Population-level impact, herd immunity, and elimination after human papillomavirus vaccination: a systematic review and meta-analysis of predictions from transmission-dynamic models [J].
Brisson, Marc ;
Benard, Elodie ;
Drolet, Melanie ;
Bogaards, Johannes A. ;
Baussano, Iacopo ;
Vanska, Simopekka ;
Jit, Mark ;
Boily, Marie-Claude ;
Smith, Megan A. ;
Berkhof, Johannes ;
Canfell, Karen ;
Chesson, Harrell W. ;
Burger, Emily A. ;
Choi, Yoon H. ;
De Blasio, Birgitte Freiesleben ;
De Vlas, Sake J. ;
Guzzetta, Giorgio ;
Hontelez, Jan A. C. ;
Horn, Johannes ;
Jepsen, Martin R. ;
Kim, Jane J. ;
Lazzarato, Fulvio ;
Matthijsse, Suzette M. ;
Mikolajczyk, Rafael ;
Pavelyev, Andrew ;
Pillsbury, Matthew ;
Shafer, Leigh Anne ;
Tully, Stephen P. ;
Turner, Hugo C. ;
Usher, Cara ;
Walsh, Cathal .
LANCET PUBLIC HEALTH, 2016, 1 (01) :E8-E17
[3]   Looking beyond human papillomavirus (HPV) genotype 16 and 18: Defining HPV genotype distribution in cervical cancers in Australia prior to vaccination [J].
Brotherton, Julia M. L. ;
Tabrizi, Sepehr N. ;
Phillips, Samuel ;
Pyman, Jan ;
Cornall, Alyssa M. ;
Lambie, Neil ;
Anderson, Lyndal ;
Cummings, Margaret ;
Payton, Diane ;
Scurry, James P. ;
Newman, Marsali ;
Sharma, Raghwa ;
Saville, Marion ;
Garland, Suzanne M. .
INTERNATIONAL JOURNAL OF CANCER, 2017, 141 (08) :1576-1584
[4]   The role of optimal control in assessing the most cost-effective implementation of a vaccination programme: HPV as a case study [J].
Brown, V. L. ;
White, K. A. Jane .
MATHEMATICAL BIOSCIENCES, 2011, 231 (02) :126-134
[5]   Tabu Search applied to global optimization [J].
Chelouah, R ;
Siarry, P .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2000, 123 (02) :256-270
[6]   Population-level impact and herd effects following the introduction of human papillomavirus vaccination programmes: updated systematic review and meta-analysis [J].
Drolet, Melanie ;
Benard, Elodie ;
Perez, Norma ;
Brisson, Marc ;
Boily, Marie-Claude ;
Ali, Hammad ;
Baldo, Vincenzo ;
Brassard, Paul ;
Brotherton, Julia M. L. ;
Callander, Denton ;
Checchi, Marta ;
Chow, Eric P. F. ;
Cocchio, Silvia ;
Dalianis, Tina ;
Deeks, Shelley L. ;
Dehlendorff, Christian ;
Donovan, Basil ;
Fairley, Christopher K. ;
Flagg, Elaine W. ;
Gargano, Julia W. ;
Garland, Suzanne M. ;
Grun, Nathalie ;
Hansen, Bo T. ;
Harrison, Christopher ;
Herweijer, Eva ;
Imburgia, Teresa M. ;
Johnson, Anne M. ;
Kahn, Jessica A. ;
Kavanagh, Kimberley ;
Kjaer, Susanne K. ;
Kliewer, Erich V. ;
Liu, Bette ;
Machalek, Dorothy A. ;
Markowitz, Lauri ;
Mesher, David ;
Munk, Christian ;
Niccolai, Linda ;
Nygard, Mari ;
Ogilvie, Gina ;
Oliphant, Jeannie ;
Pollock, Kevin G. ;
Purrinos-Hermida, Maria Jesus ;
Smith, Megan A. ;
Steben, Marc ;
Soderlund-Strand, Anna ;
Sonnenberg, Pam ;
Sparen, Par ;
Tanton, Clare ;
Wheeler, Cosette M. ;
Woestenberg, Petra J. .
LANCET, 2019, 394 (10197) :497-509
[7]  
Huang Y, 2022, PR MACH LEARN RES, V182, P631
[8]   Final efficacy, immunogenicity, and safety analyses of a nine-valent human papillomavirus vaccine in women aged 16-26 years: a randomised, double-blind trial [J].
Huh, Warner K. ;
Joura, Elmar A. ;
Giuliano, Anna R. ;
Iversen, Ole-Erik ;
de Andrade, Rosires Pereira ;
Ault, Kevin A. ;
Bartholomew, Deborah ;
Cestero, Ramon M. ;
Fedrizzi, Edison N. ;
Hirschberg, Angelica L. ;
Mayrand, Marie-Helene ;
Ruiz-Sternberg, Angela Maria ;
Stapleton, Jack T. ;
Wiley, Dorothy J. ;
Ferenczy, Alex ;
Kurman, Robert ;
Ronnett, Brigitte M. ;
Stoler, Mark H. ;
Cuzick, Jack ;
Garland, Suzanne M. ;
Kjaer, Susanne K. ;
Bautista, Oliver M. ;
Haupt, Richard ;
Moeller, Erin ;
Ritter, Michael ;
Roberts, Christine C. ;
Shields, Christine ;
Luxembourg, Alain .
LANCET, 2017, 390 (10108) :2143-2159
[9]   Bi-Level Adaptive Computed-Current Impedance Controller for Electrically Driven Robots [J].
Jalaeian-F, Mohsen ;
Fateh, Mohammad Mehdi ;
Rahimiyan, Morteza .
ROBOTICA, 2021, 39 (02) :200-216
[10]   A 9-Valent HPV Vaccine against Infection and Intraepithelial Neoplasia in Women [J].
Joura, E. A. ;
Giuliano, A. R. ;
Iversen, O-E ;
Bouchard, C. ;
Mao, C. ;
Mehlsen, J. ;
Moreira, E. D., Jr. ;
Ngan, Y. ;
Petersen, L. K. ;
Lazcano-Ponce, E. ;
Pitisuttithum, P. ;
Restrepo, J. A. ;
Stuart, G. ;
Woelber, L. ;
Yang, Y. C. ;
Cuzick, J. ;
Garland, S. M. ;
Huh, W. ;
Kjaer, S. K. ;
Bautista, O. M. ;
Chan, I. S. F. ;
Chen, J. ;
Gesser, R. ;
Moeller, E. ;
Ritter, M. ;
Vuocolo, S. ;
Luxembourg, A. .
NEW ENGLAND JOURNAL OF MEDICINE, 2015, 372 (08) :711-723